When I trim sequences for RNASeq, can I trim by sliding window on quality score or will the fact that the Illumina reads are now of different lengths affect the FPKM values (using cufflinks)? Should I just do a 3' and 5' trim of a specific length and leave all lengths equal?
Unconfigured Ad
Collapse
X
-
Not sure if there are any papers on this yet, but I don't think so.
I tried the window based trimmer ea-utils on my RNAseq datasets and realigned, getting extremely similar results to my untrimmed datasets.
I should mention this is on very short read data (35-40bp).
For longer reads with low quality ends this is of course more of an issue.
-
-
The point of trimming low-quality base-calls is to remove bases which might be erroneous. A naive aligner would be deterred from a valid mapping by mismatches that are only due to sequencing errors. A quality-aware aligner, however, knows that a mismatch of a low-quality base is no reason to reject an otherwise good mapping and will report it. Many of the currently popular aligners work this way, i.e., they pay less attention to the low-quality ends of reads. By trimming, you basically take away from the aligner the chance to make use of this feature. However, trimming makes a hard-cut decision (everything below a quality threshold is removed) while a well-designed quality-aware aligner may make a more sophisticated, gradual, decision.
Comment
-
-
Thanks for the answer- do you know which aligners work in this way? (I am working with bowtie and cufflinks)
The reason we were thinking of trimming is because we are working with mixed communities (co-cultures) so we may have two bacteria in the same Illumina. We then align all the data to each of two genomes, allowing zero mismatches.
Given this, would your answer still be not to trim?
And just for my general knowledge- do you know if different length reads if we did trim would affect the final FPKM calculations?
Thanks!
Comment
-
Latest Articles
Collapse
-
by SEQadmin2
Data variability is still an issue in sequencing technologies despite the advances in reproducibility and accuracy of these platforms. But the problem does not originate in the sequencing itself, but in the previous steps, before the sample reaches the sequencer.
The first step is collection, followed by preservation and sample preparation for analysis. Most scientists overlook those steps, but not being careful might just be skewing the experiment’s results.
...-
Channel: Articles
06-02-2026, 10:05 AM -
-
by SEQadmin2
With the launch of new single-cell sequencing platforms in 2026, the field stands at an exciting inflection point. This article surveys the most impactful advances in the field and discusses how they’re reshaping research in cancer, immunology, and beyond.
Introduction
Single-cell sequencing technologies have undergone remarkable advances over the past decade, transitioning from low-throughput experimental approaches to highly scalable platforms capable of...-
Channel: Articles
05-22-2026, 06:42 AM -
ad_right_rmr
Collapse
News
Collapse
| Topics | Statistics | Last Post | ||
|---|---|---|---|---|
|
Sequencing the Two-Toed Sloth Genome Reveals Jumping Genes Tied to Its Extreme Metabolism
by SEQadmin2
Started by SEQadmin2, 06-09-2026, 11:58 AM
|
0 responses
23 views
0 reactions
|
Last Post
by SEQadmin2
06-09-2026, 11:58 AM
|
||
|
Started by SEQadmin2, 06-05-2026, 10:09 AM
|
0 responses
29 views
0 reactions
|
Last Post
by SEQadmin2
06-05-2026, 10:09 AM
|
||
|
Started by SEQadmin2, 06-04-2026, 08:59 AM
|
0 responses
39 views
0 reactions
|
Last Post
by SEQadmin2
06-04-2026, 08:59 AM
|
||
|
Started by SEQadmin2, 06-02-2026, 12:03 PM
|
0 responses
61 views
0 reactions
|
Last Post
by SEQadmin2
06-02-2026, 12:03 PM
|
Comment