Seqanswers Leaderboard Ad

**GenoMax** · 01-15-2014, 09:46 AM

First part could be explained by having adapter/primer dimers without any insert.

As for trimming give "trimmomatic" (http://www.usadellab.org/cms/?page=trimmomatic) or cutadapt (http://code.google.com/p/cutadapt/)/trimgalore (http://www.bioinformatics.babraham.a...s/trim_galore/) a try. Recent comparison of trimmers available http://www.plosone.org/article/info:...l.pone.0085024.

**clintp** · 01-15-2014, 11:13 AM

I like the idea of trimmomatic, but I can't seem to make it trim the adapters--they still show up after the following:

Code:

java -classpath /opt/Trimmomatic-0.32/trimmomatic-0.32.jar org.usadellab.trimmomatic.TrimmomaticPE -threads 8 -trimlog TT.log Pool1_S1_L001_R1_001.fastq Pool1_S1_L001_R2_001.fastq p1r1_TT.fastq p1r1_To.fastq p1r2_TT.fastq p1r2_To.fastq LEADING:3 TRAILING:3 ILLUMINACLIP:adapter_13.fa:2:30:10 SLIDINGWINDOW:4:15 MINLEN:16

I may have the parameters set funny, but I don't know the best way to set it. My adapter sequence is the 13bp common Illumina sequence--is 13bp not scoring high enough to get trimmed?

**GenoMax** · 01-15-2014, 11:50 AM

Are you using the raw data for trimming? Why not use the TruSeq3 (PE) adapters that Trimmomatic includes (you will find those files in "Trimmomatic-0.30/adapters/") for the ILLUMINACLIP input.

**mastal** · 01-15-2014, 12:22 PM

@clintp

I think the parameters you are using for the IlluminaClip step (2:30:10 ) are too high for trimmomatic to recognize a match to a 13-base adapter sequence;

You need to either change the values or use a longer adapter sequence.

See the trimmomatic web page,

USADELLAB.org - Trimmomatic: A flexible read trimming tool for Illumina NGS data

http://www.usadellab.org/cms/?page=trimmomatic

particularly the discussion under the heading 'Adapter Fasta', from which I have extracted this quote:

'The thresholds used are a simplified log-likelihood approach. Each matching base adds just over 0.6, while each mismatch reduces the alignment score by Q/10. Therefore, a perfect match of a 12 base sequence will score just over 7"

**clintp** · 01-15-2014, 01:38 PM

@mastal
Yep, understanding the cutoff scores helped a lot (durrr). Somehow I missed that discussion on the trimmomatic page.

@GenoMax
Thanks for that reference--very useful. It's too bad they didn't include ea-utils/FastqMcf in that analysis, though.

Topics	Statistics	Last Post
Genetic Variants and Diabetes Risk in Childhood Cancer Survivors by seqadmin Started by seqadmin, Today, 08:47 AM	0 responses 12 views 0 likes	Last Post by seqadmin Today, 08:47 AM
Cancer Metastasis: A Deep Dive into Cellular Plasticity by seqadmin Started by seqadmin, 04-11-2024, 12:08 PM	0 responses 60 views 0 likes	Last Post by seqadmin 04-11-2024, 12:08 PM
Proteogenomic Profiles Offer New Clues in Prostate Cancer by seqadmin Started by seqadmin, 04-10-2024, 10:19 PM	0 responses 59 views 0 likes	Last Post by seqadmin 04-10-2024, 10:19 PM
Novel Diagnostic Assay Enhances Ovarian Cancer Detection by seqadmin Started by seqadmin, 04-10-2024, 09:21 AM	0 responses 54 views 0 likes	Last Post by seqadmin 04-10-2024, 09:21 AM

Seqanswers Leaderboard Ad

Announcement

Miseq:Trimming, and sequencing primers at the beginning of a read

Comment

Comment

Comment

Comment

Comment

Latest Articles

ad_right_rmr

News