quick question: I heard anecdotal reports that bowtie misses a large number of matches compare to programs like Maq. have anyone else heard this? in terms of %, does anyone have any numbers?
Unconfigured Ad
Collapse
X
-
Hi Doxologist,
The circumstances under which Bowtie might miss alignments that are "valid" according to its alignment policy are outlined in the manual (see last paragraph of section "Maq-like Policy"). These misses only occur in -n 2 and -n 3 modes, and they can be avoided by increasing the --maxbts parameter (at the cost of some speed). Unless your read data is very low quality, the fraction of reads missed due to the backtracking limit in -n 2 mode is generally very small (<1%).
Note that when you run 'maq' with -n 2 option (the default), it will find some alignments that actually have 3 mismatches in the seed. Bowtie will *not* report alignments with 3 mismatches in the seed unless -n 3 is specified. It's likely that this is the source of the difference that the anecdotal reports are referring to.
Thanks,
Ben
Comment
-
-
I haven't spent much time looking at BWA and SOAP2, so I'm not the expert. I like Bowtie :-), but I know others who use and are happy with BWA. I haven't met anyone with much experience with SOAP2. There is a post earlier in this thread by Heng Li that talks a little about the key differences among the three. I heard that Heng Li also has some benchmarking results that he's made public somewhere...
Thanks,
Ben
Comment
-
-
Does anyone know whether bowtie supports aligning multiple read lengths?
I am doing small RNA Solexa sequencing and so after the adapter has been removed I end up with variable length reads. With MAQ it appears that you have to run it multiple times for the different lengths, is bowtie the same?
Comment
-
-
Yes it does!Originally posted by danielsbrewer View PostDoes anyone know whether bowtie supports aligning multiple read lengths?
I am doing small RNA Solexa sequencing and so after the adapter has been removed I end up with variable length reads. With MAQ it appears that you have to run it multiple times for the different lengths, is bowtie the same?
-Adam
Comment
-
-
we had a similar discussion in another thread: http://seqanswers.com/forums/showthread.php?p=3505Originally posted by danielsbrewer View PostDoes anyone know whether bowtie supports aligning multiple read lengths?
I am doing small RNA Solexa sequencing and so after the adapter has been removed I end up with variable length reads. With MAQ it appears that you have to run it multiple times for the different lengths, is bowtie the same?
Basically, ELAND doesn't allow for different lengths and Bowtie does.
Comment
-
-
bowtie error
I am just starting out with bowtie and I am getting the following error:
Has anyone seen anything similar or know what its actually saying. I am pretty sure that the smallest read size is something like 10.Code:$ ./bowtie -p 4 -t h_sapiens ../GDB1.fastq GDB1.map Time loading forward index: 00:00:01 Time loading mirror index: 00:00:02 Error: Read (Error: Read (Error: Read (Error: Read (HHHHWWWWIIII----EEEEAAAASSSS222266669999BBBB::::5555::::1464::::711344088362:84:1::1818164431573) is less than 2 characters long7) is less than 2 characters long) is less than 2 characters long) is less than 2 characters long
Comment
-
-
Is Bowtie suitable for miRNA detection
I am just playing around with bowtie along with other software (maq,novoalign) and was wondering whether bowtie is suitable for use with an miRNA detection experiment. In a previous post Ben states that:
That hints to me that the default options might not be the best for experiments to compare miRNAs between samples. Does anyone have an opinion as to what the best options to use are?Originally posted by Ben Langmead View PostFirst, let me reemphasize that I think of Bowtie's target application as mammalian resequencing - that's how I characterize it in the manual and that's what we spend our time trying to optimize it for.
I would think that you want to know all the alignments for each read above a certain quality threshold. At the moment I am thinking of using "--best -k 100", as if there is more that 100 hits then it is probably not a "real" alignement.
Any thoughts?
Comment
-
-
Hi Ben, Just to say bowtie is great work. Far outstrips any pipeline we have used previously!
One question though - is there a way to output reads with multiple hits to a separate file? We work on repetitive regions and with a little massaging, this data may still be useful to us.
Comment
-
-
-
Yes - if you specify -k > 1 or -a, Bowtie will output the appropriate number of hits per read for reads with >=1 hit. If -m <int> is also specified, Bowtie will output no alignments for reads with > <int> alignments and, if --maxfa/--maxfq is specified, will dump those reads (the reads, not the alignments) to the specified file. For reads with <= <int> alignments, Bowtie behaves the same as if -m were not specified.
I hope that helps.
Comment
-
Latest Articles
Collapse
-
by SEQadmin2
Data variability is still an issue in sequencing technologies despite the advances in reproducibility and accuracy of these platforms. But the problem does not originate in the sequencing itself, but in the previous steps, before the sample reaches the sequencer.
The first step is collection, followed by preservation and sample preparation for analysis. Most scientists overlook those steps, but not being careful might just be skewing the experiment’s results.
...-
Channel: Articles
06-02-2026, 10:05 AM -
-
by SEQadmin2
With the launch of new single-cell sequencing platforms in 2026, the field stands at an exciting inflection point. This article surveys the most impactful advances in the field and discusses how they’re reshaping research in cancer, immunology, and beyond.
Introduction
Single-cell sequencing technologies have undergone remarkable advances over the past decade, transitioning from low-throughput experimental approaches to highly scalable platforms capable of...-
Channel: Articles
05-22-2026, 06:42 AM -
-
by SEQadmin2
Studying ecosystems means dealing with complex, multi-species communities that are hard to observe at scale. This complexity, however, hides many important questions to be answered, from how biogeochemical cycles work and how climate change can affect species distribution to how conservation strategies can work best.
Genomics, particularly since the expansion of NGS, has transformed ecosystem ecology. By sequencing environmental DNA, we can now assess biodiversity without direct...-
Channel: Articles
05-06-2026, 09:04 AM -
ad_right_rmr
Collapse
News
Collapse
| Topics | Statistics | Last Post | ||
|---|---|---|---|---|
|
Started by SEQadmin2, Today, 08:59 AM
|
0 responses
9 views
0 reactions
|
Last Post
by SEQadmin2
Today, 08:59 AM
|
||
|
Started by SEQadmin2, 06-02-2026, 12:03 PM
|
0 responses
21 views
0 reactions
|
Last Post
by SEQadmin2
06-02-2026, 12:03 PM
|
||
|
Started by SEQadmin2, 06-02-2026, 11:40 AM
|
0 responses
17 views
0 reactions
|
Last Post
by SEQadmin2
06-02-2026, 11:40 AM
|
||
|
Started by SEQadmin2, 05-28-2026, 11:40 AM
|
0 responses
30 views
0 reactions
|
Last Post
by SEQadmin2
05-28-2026, 11:40 AM
|

Comment