Unconfigured Ad

Collapse
X
 
  • Filter
  • Time
  • Show
Clear All
new posts
  • arcolombo698
    Senior Member
    • Nov 2013
    • 142

    Understanding Trimmomatic Output

    Hello.

    So for a specific trimmomatic run, I have the following output and I wish to compare this information with FastQC

    Input Read Pairs: 39100817 Both Surviving: 15397090 (39.38%) Forward Only Surviving: 12799520 (32.73%) Reverse Only Surviving: 548505 (1.40%) Dropped: 10355702 (26.48%)


    where my FastQC report shows that my total input reads is
    39100817, and after trimming the QC report shows 15397090

    So if I take the difference between these I get 23703727

    however this does not match the Dropped 10355702 from trimmomatic.

    So what is this Dropped output mean?
  • usad
    Member
    • Sep 2009
    • 53

    #2
    Hi Acrolombo

    Trimmomatic will (in this paired case) generated four files from your set of two files (FW & RV)
    First a new pair of FW and RV reads which are still paired. This is indicated by "both surviving" and this is the same numer you got out of FastQC.
    In addition to this paired sequences, some of your reads in FW did survive the trimming but their partner sequences didn't, these end up in a different file. Complementary some of the RV reads might survive, but their cognate FW read might have been trimmed down completely. This is the last file RV only.
    As most often RV reads have lower Quality you get more reads where the FW read survived than the RV read than vice versa.

    You might ask why: The issue is that if you were to drop some FW reads completely but kept their RV reads your FW.fastq file would contain a different number of reads than the RV.fastq file. As their is still useful information in unpaired reads for many aplications these are being moved to separate files.

    Cheers
    björn

    Comment

    Latest Articles

    Collapse

    • SEQadmin2
      Nine Things a Sample Prep Scientist Thinks About Before Sequencing
      by SEQadmin2


      I’m not a sequencing expert. I’m a purification scientist who uses NGS to evaluate workflows my group develops. With this perspective, we think about the sample first and the NGS workflow second. The sequencer is an exceptionally honest reporter, but it can only report on what you give it, so whether you get clean, interpretable data from an NGS workflow is largely determined before you begin.


      Here are nine questions we think about, in roughly the order they matter, before...
      06-18-2026, 07:11 AM
    • SEQadmin2
      From Collection to Sequencing: Why Sample Preparation and Preservation Define Sequencing Data
      by SEQadmin2


      Data variability is still an issue in sequencing technologies despite the advances in reproducibility and accuracy of these platforms. But the problem does not originate in the sequencing itself, but in the previous steps, before the sample reaches the sequencer.


      The first step is collection, followed by preservation and sample preparation for analysis. Most scientists overlook those steps, but not being careful might just be skewing the experiment’s results.
      ...
      06-02-2026, 10:05 AM

    ad_right_rmr

    Collapse

    News

    Collapse

    Topics Statistics Last Post
    Started by SEQadmin2, 06-17-2026, 06:09 AM
    0 responses
    30 views
    0 reactions
    Last Post SEQadmin2  
    Started by SEQadmin2, 06-09-2026, 11:58 AM
    0 responses
    96 views
    0 reactions
    Last Post SEQadmin2  
    Started by SEQadmin2, 06-05-2026, 10:09 AM
    0 responses
    115 views
    0 reactions
    Last Post SEQadmin2  
    Started by SEQadmin2, 06-04-2026, 08:59 AM
    0 responses
    109 views
    0 reactions
    Last Post SEQadmin2  
    Working...