I'm also interested in how MAQ assigns quality scores. Can you confirm what you meant by "Q0;Q10;Q20" in the MAQ tests? Is this a threshold on the quality score that MAQ gives the alignment (as opposed to the quality score of the read in the Fastq file)? If a read maps to multiple locations, MAQ reports one location at random and assigns a quality score of 0. Therefore the Q0 accuracy should be much less than if you had excluded these alignments. I think that this behavior is a bit strange; it would be less confusing if MAQ didn't report any matches for the non-uniquely mapping reads and instead reported the number of places that the read maps (the whole read, not just the first 25 bases).
Seqanswers Leaderboard Ad
Collapse
Announcement
Collapse
No announcement yet.
X
-
Q0;Q10;Q20 is threshold on the alignment quality score assigned by MAQ.
MAQ is initially designed for resequencing and keeping these repetitive reads is quite useful for the subsequent SNP calling with MAQ. This also helps CNV calling. I could understand that a lot of people do not want to see all these repetitive reads, but putting a threshold on mapping quality is very easy anyway. In addition, different people may want to set different different threshold.
As for the calculation of mapping quality, it just follows a very simple Bayesian procedure. You can calculate p(z|x,u) of read z mapped to u on the reference x. With Bayesian formula you get p(u|x,z). The mapping quality is -10log10(1-p(u|x,z)).
Comment
-
Mira
Hello everybody,
I am new to this group and couldnt resist myself to join this exiciting discussion
Has anybody heard of MIRA ?
There is this guy quitely working on another software tool for Next Gen assembly .
The USP of this tool is it can perform a true hybrid assembly SAnger+454 or 454+Solexa which I believe will solve the Next gen assembly issues.
Although the version 2.9.95 doesnt support SNP analysis yet but its compact.
This tool might be on slower side becuase it performs assembly iterative correcting errors on the way.
I hope somebody evaluates this new version becuase I dont have the much needed hardware to run this program.
regds,
Amit
Comment
-
Originally posted by lh3 View PostThe latest version, 0.6.6, can output ALL hits with 0- or 1-mismatch in the seed.
I was excited to see this feature added to MAQ in the latest version as much of my work is applied to RNA (hence, it is quantitative). This should (hopefully) allow me to reduce some biases introduced by losing reads which map to multiple locations. Now, I am wondering, how do I go about using this feature? Is there a new option when running maq map? Or mapview? I have been unable to find it in the manpage.
Thanks,
Ryan
FOLLOWUP:
I found out the answer to this in the latest doc provided with version 0.6.6 (not the version on the sourceforge page).
Usage: maq map [options] <out.map> <chr.bfa> <reads_1.bfq> [reads_2.bfq]
Options: -1 INT length of the first read (<64) [0]
-2 INT length of the second read (<64) [0]
-m FLOAT rate of difference between reads and references [0.001]
-e INT maximum allowed sum of qualities of mismatches [70]
-d FILE adapter sequence file [null]
-a INT max distance between two paired reads [250]
-n INT number of mismatches in the first 24bp [2]
-M c|g methylation alignment mode [null]
-u FILE dump unmapped and poorly aligned reads to FILE [null]
-H FILE dump multiple/all 01-mismatch hits to FILE [null]
-C INT max number of hits to output. >512 for all 01 hits. [250]
-s INT seed for random number generator [random]
-N record mismatch positions (max read length<=55)
-t trim all reads (usually not recommended)
-c match in the colorspaceLast edited by myrna; 05-05-2008, 02:00 PM.
Comment
-
Eland is going to be hard to beat. It has had a few years of hard work, optimisation and thought put into it by Anthony (SSAHA) Cox at Solexa. It was designed from day 1 for aligning far more reads than you currently get from a GAII, to a full human reference on a desktop computer.
Comment
-
SeqMap (http://biogibbs.stanford.edu/~jiangh/SeqMap/) - work like ELand, can do 3 or more bp mismatches and also insdel
Comment
-
Hi Heng!
Have you heard about SOCS (http://bioinformatics.oxfordjournals...tract/btn512v1, http://socs.biology.gatech.edu/)? In their article they say that "The overall algorithm is similar to that used by software tools developed for analysis of Illumina-Solexa data (Li et al, 2008; Smith et al, 2008)
I'm interested in alignment of SOLiD data and I'd like to know your opinion what to use.. Maq, Mosaik, SHRiMP, ZOOM or this new tool SOCS ..
Best regards,
Valentina
Comment
-
Where to find a recent benchmarking?
Hi lh3,
Thanks for the original benchmarking! I'm actually looking for such a benchmarking including the latest tools, like ZOOM!, bowtie, R Biostings pairwiseAlignment(), etc. Has anybody heard of that?
Cheers,
N.
Comment
Latest Articles
Collapse
-
by seqadmin
The field of immunogenetics explores how genetic variations influence immune responses and susceptibility to disease. In a recent SEQanswers webinar, Oscar Rodriguez, Ph.D., Postdoctoral Researcher at the University of Louisville, and Ruben Martínez Barricarte, Ph.D., Assistant Professor of Medicine at Vanderbilt University, shared recent advancements in immunogenetics. This article discusses their research on genetic variation in antibody loci, antibody production processes,...-
Channel: Articles
11-06-2024, 07:24 PM -
-
by seqadmin
Next-generation sequencing (NGS) and quantitative polymerase chain reaction (qPCR) are essential techniques for investigating the genome, transcriptome, and epigenome. In many cases, choosing the appropriate technique is straightforward, but in others, it can be more challenging to determine the most effective option. A simple distinction is that smaller, more focused projects are typically better suited for qPCR, while larger, more complex datasets benefit from NGS. However,...-
Channel: Articles
10-18-2024, 07:11 AM -
ad_right_rmr
Collapse
News
Collapse
Topics | Statistics | Last Post | ||
---|---|---|---|---|
Started by seqadmin, Today, 11:09 AM
|
0 responses
24 views
0 likes
|
Last Post
by seqadmin
Today, 11:09 AM
|
||
Started by seqadmin, Today, 06:13 AM
|
0 responses
20 views
0 likes
|
Last Post
by seqadmin
Today, 06:13 AM
|
||
Started by seqadmin, 11-01-2024, 06:09 AM
|
0 responses
30 views
0 likes
|
Last Post
by seqadmin
11-01-2024, 06:09 AM
|
||
New Model Aims to Explain Polygenic Diseases by Connecting Genomic Mutations and Regulatory Networks
by seqadmin
Started by seqadmin, 10-30-2024, 05:31 AM
|
0 responses
21 views
0 likes
|
Last Post
by seqadmin
10-30-2024, 05:31 AM
|
Comment