Unconfigured Ad

**adamdeluca** · 08-17-2010, 04:12 AM

It all depends on your application, because this is a tradeoff between the quality of the alignment and the number of reads aligned. Figure between 30 and 40 per mismatch.

**mrawlins** · 08-17-2010, 07:46 AM

For RNA-Seq the desired output is usually a read count, so the reads only have to be of sufficient quality to map to the right location. The value for e in those applications can be 300+, depending on read length, without sacrificing quality of results.
For SNP calling the quality of the reads is more important than the quantity, so a much lower -e is useful. For longer reads (80 bases) I wouldn't do anything lower than 100.

I generally have used this method to figure out how to set -e.
How many of the bases not covered in the seed would I tolerate being wrong, assuming they are high-quality bases. I take that number times 30 to set -e. If you don't care about what comes after the seed, take the number of non-seed bases and multiply by 30.
Larger values for -e seem to slow bowtie down.

**fkrueger** · 08-17-2010, 07:55 AM

Quality values get rounded to a the nearest 10, which means reads will be rejected if you have 3 high quality mismatches (it saturates at 30) in your mismatch. If the basecall quality is quite bad however, you can easily end up with 10 or 15 low scoring mismatches.

As adamdeluca mentioned already there might be applications where it is worth increasing the limit (e.g. many high quality SNPs if you are sequencing another strain). Increasing -e does increase the alignment time considerably however.

It might be worth performing some quality control on the data to see if the error rates start to increase drastically towards later cycles (e.g. with fastqc), and if so you might just trim all sequences to a cycle where you do still trust the basecalls before running bowtie.

Topics	Statistics	Last Post
Large-Scale Protein Screen Uncovers Hidden Regulators of Alternative Polyadenylation by SEQadmin2 Started by SEQadmin2, 06-26-2026, 11:10 AM	0 responses 16 views 0 reactions	Last Post by SEQadmin2 06-26-2026, 11:10 AM
Whole-Genome Sequencing Traces Faroe Islands Ancestry to a North Atlantic Founder Population by SEQadmin2 Started by SEQadmin2, 06-17-2026, 06:09 AM	0 responses 49 views 0 reactions	Last Post by SEQadmin2 06-17-2026, 06:09 AM
Sequencing the Two-Toed Sloth Genome Reveals Jumping Genes Tied to Its Extreme Metabolism by SEQadmin2 Started by SEQadmin2, 06-09-2026, 11:58 AM	0 responses 108 views 0 reactions	Last Post by SEQadmin2 06-09-2026, 11:58 AM
A New Method Makes Hantavirus Genome Analysis Faster and More Accessible by SEQadmin2 Started by SEQadmin2, 06-05-2026, 10:09 AM	0 responses 125 views 0 reactions	Last Post by SEQadmin2 06-05-2026, 10:09 AM

Unconfigured Ad

question about bowtie -e parameter

Comment

Comment

Comment

Latest Articles

ad_right_rmr

News