Unconfigured Ad

Collapse
X
 
  • Filter
  • Time
  • Show
Clear All
new posts
  • Noa
    Member
    • Jun 2011
    • 62

    Quality trimming for RNA Seq

    When I trim sequences for RNASeq, can I trim by sliding window on quality score or will the fact that the Illumina reads are now of different lengths affect the FPKM values (using cufflinks)? Should I just do a 3' and 5' trim of a specific length and leave all lengths equal?
  • colindaven
    Senior Member
    • Oct 2008
    • 417

    #2
    Not sure if there are any papers on this yet, but I don't think so.

    I tried the window based trimmer ea-utils on my RNAseq datasets and realigned, getting extremely similar results to my untrimmed datasets.
    I should mention this is on very short read data (35-40bp).

    For longer reads with low quality ends this is of course more of an issue.

    Comment

    • Simon Anders
      Senior Member
      • Feb 2010
      • 995

      #3
      This is unless the aligner takes base-call quality scores into account, in which case trimming should be unnecessary anyway.

      Comment

      • Noa
        Member
        • Jun 2011
        • 62

        #4
        Can you please elaborate on that? Why would the trimming be unnecessary?

        Comment

        • Simon Anders
          Senior Member
          • Feb 2010
          • 995

          #5
          The point of trimming low-quality base-calls is to remove bases which might be erroneous. A naive aligner would be deterred from a valid mapping by mismatches that are only due to sequencing errors. A quality-aware aligner, however, knows that a mismatch of a low-quality base is no reason to reject an otherwise good mapping and will report it. Many of the currently popular aligners work this way, i.e., they pay less attention to the low-quality ends of reads. By trimming, you basically take away from the aligner the chance to make use of this feature. However, trimming makes a hard-cut decision (everything below a quality threshold is removed) while a well-designed quality-aware aligner may make a more sophisticated, gradual, decision.
          Last edited by Simon Anders; 03-01-2012, 09:57 AM. Reason: reworded

          Comment

          • Noa
            Member
            • Jun 2011
            • 62

            #6
            Thanks for the answer- do you know which aligners work in this way? (I am working with bowtie and cufflinks)
            The reason we were thinking of trimming is because we are working with mixed communities (co-cultures) so we may have two bacteria in the same Illumina. We then align all the data to each of two genomes, allowing zero mismatches.
            Given this, would your answer still be not to trim?
            And just for my general knowledge- do you know if different length reads if we did trim would affect the final FPKM calculations?
            Thanks!

            Comment

            Latest Articles

            Collapse

            ad_right_rmr

            Collapse

            News

            Collapse

            Topics Statistics Last Post
            Started by SEQadmin2, 06-09-2026, 11:58 AM
            0 responses
            23 views
            0 reactions
            Last Post SEQadmin2  
            Started by SEQadmin2, 06-05-2026, 10:09 AM
            0 responses
            29 views
            0 reactions
            Last Post SEQadmin2  
            Started by SEQadmin2, 06-04-2026, 08:59 AM
            0 responses
            39 views
            0 reactions
            Last Post SEQadmin2  
            Started by SEQadmin2, 06-02-2026, 12:03 PM
            0 responses
            61 views
            0 reactions
            Last Post SEQadmin2  
            Working...