Why should we remove low quality sequences initially, while we have the option to remove the low quality variants at the final stage from the vcf file?
Seqanswers Leaderboard Ad
Collapse
Announcement
Collapse
No announcement yet.
X
-
Hi,
Like syfo mentioned, read trimming based on quality can be beneficial. However, with Illumina reads you would generally expect very good quality (except at the end of the read). So it might not need a separate read trimming step in your analysis.
One thing you could anyways do is trim reads while aligning them. If you plan on using BWA for alignment, you can use the bwa aln "-q" parameter. This can easily be done for paired end data as explained in the BWA manual.
To understand what the "q" parameter does, you could read the following post:
Discussion of next-gen sequencing related bioinformatics: resources, algorithms, open source efforts, etc
Hopefully this way, you can bypass that separate read trimming phase and this should also answer the concern of improper read mapping and thus potential false positives in variant calls.
I hope this helps.
Praful
Comment
-
I am not sure but I think it should.
Technically though, I'll be a little worried about samples whose reads quality just tanks midway and spend a little more time trouble shooting what might have gone wrong that you come across such a scenario.
Did you happen to use FastQC to see how the sequencing runs look? If so, can you share the "per base quality" graph for any of the samples where the read quality goes down?
Praful
Comment
-
Hi,
These don't look good at all. I don't know what might have caused this, but I think before thinking about trimming reads you should talk to someone at Illumina and also people who did the sequencing to try and figure what might have gone wrong. You could also post this on seqanswers as a separate blog and see if people have any suggestions.
Comment
-
Hi members,
Following up from the original title.. I have a question relating to mapping/alignment...which has to do with trimming as well..
The question is during mapping we are able to specify which phred score to choose from.. therefore am I correct to say that the low phred score nucleotides will not be mapped back to the reference sequence?? Then we do not need to trim the reads (assuming the low quality nucleotides are right at the end / beginning of a read [makes it easier to trim])
I think this is not true as I can see the low quality nucleotides in IGV, therefore the following question would be.. what is the use of specifying the phred quality score during mapping? if the low quality nucleotides are also being mapped??
Many thanks
Comment
Latest Articles
Collapse
-
by seqadmin
Next-generation sequencing (NGS) and quantitative polymerase chain reaction (qPCR) are essential techniques for investigating the genome, transcriptome, and epigenome. In many cases, choosing the appropriate technique is straightforward, but in others, it can be more challenging to determine the most effective option. A simple distinction is that smaller, more focused projects are typically better suited for qPCR, while larger, more complex datasets benefit from NGS. However,...-
Channel: Articles
10-18-2024, 07:11 AM -
ad_right_rmr
Collapse
News
Collapse
Topics | Statistics | Last Post | ||
---|---|---|---|---|
Started by seqadmin, 11-01-2024, 06:09 AM
|
0 responses
24 views
0 likes
|
Last Post
by seqadmin
11-01-2024, 06:09 AM
|
||
New Model Aims to Explain Polygenic Diseases by Connecting Genomic Mutations and Regulatory Networks
by seqadmin
Started by seqadmin, 10-30-2024, 05:31 AM
|
0 responses
21 views
0 likes
|
Last Post
by seqadmin
10-30-2024, 05:31 AM
|
||
Started by seqadmin, 10-24-2024, 06:58 AM
|
0 responses
25 views
0 likes
|
Last Post
by seqadmin
10-24-2024, 06:58 AM
|
||
New AI Model Designs Synthetic DNA Switches for Targeted Gene Expression in Specific Cell Types
by seqadmin
Started by seqadmin, 10-23-2024, 08:43 AM
|
0 responses
55 views
0 likes
|
Last Post
by seqadmin
10-23-2024, 08:43 AM
|
Comment