Seqanswers Leaderboard Ad

Collapse

Announcement

Collapse
No announcement yet.
X
 
  • Filter
  • Time
  • Show
Clear All
new posts

  • Transcripts from RNA-seq assembly

    I assembled the transcriptome of an unsequenced organism using Trinity. I would like to do some downstream analysis on the assembled transcripts. When I look at the transcripts and translate them, I find they are littered with stop codons, does that indicate there is some issue with sequencing? How can one go from the transcript to the amino acid sequence? Can we assume that the beginning of the transcript would be the beginning of the transcription start site for the gene? I would love any pointers on this.

    Thanks,

  • #2
    Can we assume that the beginning of the transcript would be the beginning of the transcription start site for the gene?
    This is unlikely. Trinity assembles the whole RNA, including untranslated regions. If you want to look at protein predictions de-novo, then you need something like frameDP to correct for frame shift errors and identify the most likely start/stop sites for translation.

    Comment


    • #3
      Are you sure you were in the right reading frame when checking for stop codons? Look for start codons to fix the reading frame.

      Can we assume that the beginning of the transcript would be the beginning of the transcription start site for the gene?
      I would say, possibly yes, if you meant to say transcription start site. But gringer might have been right in assuming that you meant the translation start.

      Comment


      • #4
        Originally posted by Simon Anders View Post
        I would say, possibly yes, if you meant to say transcription start site. But gringer might have been right in assuming that you meant the translation start.
        Whoops, yes, I read that wrong. Just to add to this, depending on the mapping quality the start point may not necessarily be exactly at the transcript start location.

        Comment


        • #5
          Thanks. That clarifies some concepts. Yes, I had meant the Translation start site instead of transcription. Now my data makes more sense as well since very few of the assembled transcripts were starting with a start codon. Is FrameDP a good choice for finding peptide sequences? I saw it had only 10 citations to date. Are there any other tools for doing this?

          Thanks.

          Comment


          • #6
            Is FrameDP a good choice for finding peptide sequences? I saw it had only 10 citations to date. Are there any other tools for doing this?
            Have a look here for other options:
            Discussion of next-gen sequencing related bioinformatics: resources, algorithms, open source efforts, etc


            FrameDP generates huge amounts of files, something like 5-10 times the number of transcripts, so you need to be careful with that. My computer claimed to have run out of space (with ~300GB free) due to me not being careful enough running that program.

            Comment


            • #7
              I'm having the same problem just now. When I translate the assembled transcripts (in the 6 pssible reading frames), they are full of stop codons and I don't know what to do. I've been thinking about not normalizing the libraries when assembling in Trinity but I don't know if it make sense...

              Did you finally solve the problem?

              Thank you

              Comment

              Latest Articles

              Collapse

              • seqadmin
                Genetic Variation in Immunogenetics and Antibody Diversity
                by seqadmin



                The field of immunogenetics explores how genetic variations influence immune responses and susceptibility to disease. In a recent SEQanswers webinar, Oscar Rodriguez, Ph.D., Postdoctoral Researcher at the University of Louisville, and Ruben Martínez Barricarte, Ph.D., Assistant Professor of Medicine at Vanderbilt University, shared recent advancements in immunogenetics. This article discusses their research on genetic variation in antibody loci, antibody production processes,...
                11-06-2024, 07:24 PM
              • seqadmin
                Choosing Between NGS and qPCR
                by seqadmin



                Next-generation sequencing (NGS) and quantitative polymerase chain reaction (qPCR) are essential techniques for investigating the genome, transcriptome, and epigenome. In many cases, choosing the appropriate technique is straightforward, but in others, it can be more challenging to determine the most effective option. A simple distinction is that smaller, more focused projects are typically better suited for qPCR, while larger, more complex datasets benefit from NGS. However,...
                10-18-2024, 07:11 AM

              ad_right_rmr

              Collapse

              News

              Collapse

              Topics Statistics Last Post
              Started by seqadmin, Today, 11:09 AM
              0 responses
              24 views
              0 likes
              Last Post seqadmin  
              Started by seqadmin, Today, 06:13 AM
              0 responses
              20 views
              0 likes
              Last Post seqadmin  
              Started by seqadmin, 11-01-2024, 06:09 AM
              0 responses
              30 views
              0 likes
              Last Post seqadmin  
              Started by seqadmin, 10-30-2024, 05:31 AM
              0 responses
              21 views
              0 likes
              Last Post seqadmin  
              Working...
              X