Seqanswers Leaderboard Ad

**lorendarith** · 12-01-2013, 10:09 AM

It could mean that the quality in read 2 was lower than read 1, this is why you retain a lot of unpaired read 1 where the corresponding read 2 was dropped due to being trimmed to short or having a very low quality.

**alpha2zee** · 12-01-2013, 11:02 AM

Originally posted by lorendarith View Post

It could mean that the quality in read 2 was lower than read 1, this is why you retain a lot of unpaired read 1 where the corresponding read 2 was dropped due to being trimmed to short or having a very low quality.

This is a possibility. However, it seems unlikely in my case (I examined read qualities using FastQC).

I wonder if the asymmetry that I am seeing is because I am not using the keepBothReads option of trimmomatic. From the manual:

After read-though has been detected by palindrome mode, and the adapter sequence removed, the reverse read contains the same sequence information as the forward read, albeit in reverse complement. For this reason, the default behaviour is to entirely drop the reverse read. By specifying 'true' for this parameter, the reverse read will also be retained, which may be useful e.g. if the downstream tools cannot handle a combination of paired and unpaired reads.

**alpha2zee** · 12-01-2013, 11:53 AM

Originally posted by alpha2zee View Post

I wonder if the asymmetry that I am seeing is because I am not using the keepBothReads option of trimmomatic.

It seems this is not the reason. I tested this, with 'ILLUMINACLIP:TruSeq3-PE-2.fa:2:30:10:8:TRUE' (see my first post). Usage of keepBothReads as per the manual is ILLUMINACLIP:<fastaWithAdaptersEtc>:<seed mismatches>:<palindrome clip threshold>:<simple clip threshold>:<minAdapterLength>:<keepBothReads>.

**alpha2zee** · 12-01-2013, 05:17 PM

I tweaked the trimmomatic run parameters a bit and it seems to have a significant effect: there are less unpaired reads, and the asymmetry between the left and right unpaired reads is less as well.

Code:

java -jar trimmomatic-0.32.jar PE -threads 16 -phred33 sample_1.fastq sample_2.fastq sample_trimmed_paired_1.fastq.gz sample_trimmed_unpaired_1.fastq.gz sample_trimmed_paired_2.fastq.gz sample_trimmed_unpaired_2.fastq.gz ILLUMINACLIP:adapters/TruSeq3-PE-2.fa:2:30:10:8:TRUE LEADING:3 TRAILING:3 SLIDINGWINDOW:4:15 MINLEN:26

**mastal** · 12-02-2013, 04:44 AM

yes, it looks like the difference in size of your R1_unpaired file is due to changing the parameters so that Trimmomatic keeps both reads after trimming adapter sequences in palindrome mode, rather than the default behaviour, which is to discard R2, thus leaving R1 as unpaired.

**annaprotasio** · 11-19-2014, 05:57 AM

Hi,
just wanted to say that this thread really helped sorting out my trimmomatic call.

I have smallRNA libraries and due to their nature, a big proportion of the read is adapter. I was running the default mode and was quite unhappy about the results.

Code:

java -Xmx1000m -jar ./Trimmomatic-0.32/trimmomatic-0.32.jar PE -threads 4 1.fastq.gz 2.fastq.gz out_1.fq out.unpaired_1.fq out_2.fq out.unpaired_2.fq ILLUMINACLIP:miRNA.neb.solexa.adapters.fasta:2:10:7 MINLEN:15

The file sizes for the output files were quite discouraging:

102M Nov 7 10:53 1.out.unpaired_2.fq
56K Nov 7 10:53 1.out.unpaired_1.fq
56M Nov 7 10:53 1.out_2.fq
52M Nov 7 10:53 1.out_1.fq

After considering the suggested changes in this thread, my new call is:

Code:

java -Xmx1000m -jar ./Trimmomatic-0.32/trimmomatic-0.32.jar PE -threads 4 1.fastq.gz 2.fastq.gz out_1.fq out.unpaired_1.fq out_2.fq out.unpaired_2.fq ILLUMINACLIP:miRNA.neb.solexa.adapters.fasta:2:30:10:8:TRUE LEADING:3 TRAILING:3 SLIDINGWINDOW:4:30 MINLEN:15

And the files' sizes look much better:

13M Nov 19 13:41 2.out.unpaired_2.fq
32M Nov 19 13:41 2.out.unpaired_1.fq
484M Nov 19 13:41 2.out_2.fq
517M Nov 19 13:41 2.out_1.fq

PS I am aware that PE is overkill for miRNAs, but SE was not available to us at the time

Topics	Statistics	Last Post
ASHG 2024 Highlights – Part Two by seqadmin Started by seqadmin, Today, 11:09 AM	0 responses 24 views 0 likes	Last Post by seqadmin Today, 11:09 AM
ASHG 2024 Highlights – Part One by seqadmin Started by seqadmin, Today, 06:13 AM	0 responses 20 views 0 likes	Last Post by seqadmin Today, 06:13 AM
Seq-Scope Expands Possibilities for High-Resolution Gene Expression Analysis by seqadmin Started by seqadmin, 11-01-2024, 06:09 AM	0 responses 30 views 0 likes	Last Post by seqadmin 11-01-2024, 06:09 AM
New Model Aims to Explain Polygenic Diseases by Connecting Genomic Mutations and Regulatory Networks by seqadmin Started by seqadmin, 10-30-2024, 05:31 AM	0 responses 21 views 0 likes	Last Post by seqadmin 10-30-2024, 05:31 AM

Seqanswers Leaderboard Ad

Announcement

Asymmetric trimmomatic output with paired-end RNA seq. data

Comment

Comment

Comment

Comment

Comment

Comment

Latest Articles

ad_right_rmr

News