Seqanswers Leaderboard Ad

**nucacidhunter** · 05-27-2017, 07:15 PM

I agree with you that with deeper sequencing %duplicate should increase and read length differences are less likely to be the cause as FastQC uses initial 50 sequence of a subset of reads for duplicate calculation.

It would be helpful if you could post the whole FastQC report for both runs as other plots might give some clues about the cause.

**Jaeb** · 05-28-2017, 12:55 AM

Thanks for your fast reply. I did attach now the complete FastQC reports....

**nucacidhunter** · 05-28-2017, 03:21 AM

I think HS4000 reads contain lots of errors due to positional lower quality so the sequences of duplicates do not match and they are reported as unique reads. Also lots of reads seems to have very low quality over the whole length of read. If you trim or filter low quality reads you should get similar duplication rate for both runs.

**GenoMax** · 05-28-2017, 04:08 AM

I suggest that you run clumpify.sh from BBMap to get an exact idea of the duplication. You can allow for errors when doing the sequence match. FastQC does not look at the entire dataset for some of the modules (only a % of data is sampled).

Even though there is a thread for clumpify here the one over at Biostars has the directions clearly defined on one page.

Topics	Statistics	Last Post
Cancer Metastasis: A Deep Dive into Cellular Plasticity by seqadmin Started by seqadmin, 04-11-2024, 12:08 PM	0 responses 27 views 0 likes	Last Post by seqadmin 04-11-2024, 12:08 PM
Proteogenomic Profiles Offer New Clues in Prostate Cancer by seqadmin Started by seqadmin, 04-10-2024, 10:19 PM	0 responses 30 views 0 likes	Last Post by seqadmin 04-10-2024, 10:19 PM
Novel Diagnostic Assay Enhances Ovarian Cancer Detection by seqadmin Started by seqadmin, 04-10-2024, 09:21 AM	0 responses 26 views 0 likes	Last Post by seqadmin 04-10-2024, 09:21 AM
Evolutionary Dynamics of Centromeres: A Comparative Genomic Analysis by seqadmin Started by seqadmin, 04-04-2024, 09:00 AM	0 responses 52 views 0 likes	Last Post by seqadmin 04-04-2024, 09:00 AM

Seqanswers Leaderboard Ad

Announcement

0% duplicates in RNA-Seq/Drop-seq library

Comment

Comment

Comment

Comment

Latest Articles

ad_right_rmr

News