Bowtie, an ultrafast, memory-efficient, open source short read aligner

Ben Langmead replied

11-05-2009, 07:17 AM
Originally posted by liu3zhen View Post

Sorry for the confusing question.
If I specify -k 4 --best, multiple alignments will be reported even one best hit is found. I'm wondering is it possible only the best hit (only 1) is report for this case, rather than 4 alignments reported in best-to-worst order.

Please take a look at the documentation for the --strata option in the manual. If that doesn't do what you'd like, please post a small example where Bowtie won't report what you'd like.

Ben
Leave a comment:
liu3zhen replied

11-05-2009, 07:13 AM
Originally posted by Ben Langmead View Post

I don't follow your question; please post an example where Bowtie's output is not what you expect/want.

Ben

Sorry for the confusing question.
If I specify -k 4 --best, multiple alignments will be reported even one best hit is found. I'm wondering is it possible only the best hit (only 1) is report for this case, rather than 4 alignments reported in best-to-worst order.
Leave a comment:
Ben Langmead replied

11-05-2009, 07:03 AM
Originally posted by liu3zhen View Post

Thanks, Ben. Bother you again for more questions:
I think Bowtie is more like BLAST in term of output. The specified maximum alignmens were reported in the order of best-to-worst. Does Bowtie have a way to report only the "equal" best hits. If one best hit is found, only report one alignment. If multiple best hits are found, report up to the specified maximum number of alignments. This is important because I know which reads really hit multiple places in the genome by doing so.

I don't follow your question; please post an example where Bowtie's output is not what you expect/want.

Ben
Leave a comment:
liu3zhen replied

11-05-2009, 07:01 AM
Originally posted by Ben Langmead View Post

A subset (a subset of 1 in the case of -k 1) is chosen at random.

Thanks, Ben. Bother you again for more questions:
I think Bowtie is more like BLAST in term of output. The specified maximum alignmens were reported in the order of best-to-worst. Does Bowtie have a way to report only the "equal" best hits. If one best hit is found, only report one alignment. If multiple best hits are found, report up to the specified maximum number of alignments. This is important because I know which reads really hit multiple places in the genome by doing so.
Leave a comment:
Ben Langmead replied

11-05-2009, 06:01 AM
Originally posted by liu3zhen View Post

1. Is it possible some reads only have 2 hits even -k 4 and --best are specified?

Yes.

Originally posted by liu3zhen View Post

2. How does Bowtie deal with several "equal" best alignments but -k 1 --best were specified? This is the question for how Bowtie deal with repetitive reads.

A subset (a subset of 1 in the case of -k 1) is chosen at random.

Thanks,
Ben
Leave a comment:
liu3zhen replied

11-04-2009, 02:17 PM
Originally posted by Ben Langmead View Post

That's right; --best does not limit the number of alignments Bowtie reports. If you ask for 1 alignment (default), --best guarantees it's the best. If you ask for -k 4, --best guarantees they're the 4 best, reported in best-to-worst order. If you ask for -a, --best guarantees that you'll get all of them in best-to-worst order.

Thanks
Ben

Thank for quick reply.
More questions for this topic:
1. Is it possible some reads only have 2 hits even -k 4 and --best are specified?
2. How does Bowtie deal with several "equal" best alignments but -k 1 --best were specified? This is the question for how Bowtie deal with repetitive reads.

Thanks again.

Last edited by liu3zhen; 11-04-2009, 02:20 PM.
Leave a comment:
Ben Langmead replied

11-04-2009, 01:39 PM
Originally posted by ecabot View Post

1) Is it possible to obtain an alignment for a read pair where one read uniquely maps but the other doesn't? (my guess: no)

Definitely yes! That's exactly where paired-end sequencing pays off . If either read aligns uniquely, that alignment will be used as an anchor to look for the mate's alignment and, if it's found, that paired-end alignment will be reported.

Originally posted by ecabot View Post

2) Does the mismatch setting apply to both reads or are they taken together. In other words if 1 mismatch is specified, can both members of a pair each have 1-mismatch? (my guess: yes)

The mismatch setting applies to each read. So, yes, if -v 1 is specified, *both* mates are allowed to have a mismatch.

Hope that helps,
Ben
Leave a comment:
Ben Langmead replied

11-04-2009, 01:35 PM
Originally posted by liu3zhen View Post

But the manual said that if -k or -a >1 and --best are specified, only best alignments will be reported and they are appear in best-to-worst order, which means that the best alignments are not "equally best".

That's right; --best does not limit the number of alignments Bowtie reports. If you ask for 1 alignment (default), --best guarantees it's the best. If you ask for -k 4, --best guarantees they're the 4 best, reported in best-to-worst order. If you ask for -a, --best guarantees that you'll get all of them in best-to-worst order.

Thanks
Ben
Leave a comment:
ecabot replied

11-04-2009, 01:34 PM
are pairs considered separately wrt mismatches and uniquness with soap-like policy

I have a couple of questions about how Bowtie deals with mismatches in a paired end run. (Using -v 1 and -m 1). I have my guesses as to how things work, but I am hoping that someone knowlegeable (e.g. Ben) will ring-in with the correct information.

1) Is it possible to obtain an alignment for a read pair where one read uniquely maps but the other doesn't? (my guess: no)

2) Does the mismatch setting apply to both reads or are they taken together. In other words if 1 mismatch is specified, can both members of a pair each have 1-mismatch? (my guess: yes)
Leave a comment:
Ben Langmead replied

11-04-2009, 01:33 PM
Originally posted by liu3zhen View Post

A question for number of mismatches. I can not set up -v 4. (error: -v arg must be at most 3) Does that mean Bowtie at most allow 3 mismatches for whatever length of reads? Thanks.

Hi liu3zhen,

To allow more than 3 mismatches in the alignment, use the Maq-like options: -n/-l/-e instead of -v.

Thanks,
Ben
Leave a comment:
Ben Langmead replied

11-04-2009, 01:32 PM
Originally posted by ramouz87 View Post

The data quality is not optimal but i guess that having no alignment using paired end is not due to that fact and probably parameter should be tuned.
Any one could give me some insight about the optimal setting for the paired end alignment ?
Thanks in advance,
Best,
ramzi

Hi Ramzi,

The options you're looking for are almost certainly -I/-X and --ff/--fr/--rf. You need to have a reasonably good idea of the expected insert size and specify an appropriate range with -I/-X. You should also confirm that your paired-end protocol produces pairs in the fw/rev orientation. This is the typical configuration for Illumina. If your paired-end data has a different orientation, change it with --ff or --rf.

Hope that helps,
Ben
Leave a comment:
liu3zhen replied

11-04-2009, 01:28 PM
Another question:

I'm reading the manual for -k -a and --best.

I'm confusing about if we put (-k or -a) with --best together. I thought that if a read has several "best" alignments, these "best" should have kinds of "equal" alignment scores. But the manual said that if -k or -a >1 and --best are specified, only best alignments will be reported and they are appear in best-to-worst order, which means that the best alignments are not "equally best".

Hopefully get your help soon, thanks.
Leave a comment:
liu3zhen replied

11-04-2009, 12:58 PM
A question for number of mismatches. I can not set up -v 4. (error: -v arg must be at most 3) Does that mean Bowtie at most allow 3 mismatches for whatever length of reads? Thanks.
Leave a comment:
ramouz87 replied

11-03-2009, 07:15 AM
Hi,
I'm New in the field of NGS (was working mainly on microarray data analysis) and i'm starting to invastigate comon tools related to sequence analysis.
I have human data (paired reads/ 75 base) and used Bowtie for the alignment.
I used standard parameter for alignment :
bowtie -t -p 8 h_sapiens_37_asm ./s_8_1_sequence.fq ./s_8_1_sequence.fq.bowtie.align
bowtie -t -p 8 h_sapiens_37_asm ./s_8_2_sequence.fq ./s_8_2_sequence.fq.bowtie.align
bowtie -t -p 8 h_sapiens_37_asm -1 ./s_8_1_sequence.fq -2 ./s_8_2_sequence.fq ./s_8_sequence.fq.bowtie.align

and I get respectively the following results:
# reads processed: 6660511
# reads with at least one reported alignment: 4615451 (69.30%)
# reads that failed to align: 2045060 (30.70%)
# reads with at least one reported alignment: 5050548 (75.83%)
# reads that failed to align: 1609963 (24.17%)
# reads with at least one reported alignment: 13371 (0.20%)
# reads that failed to align: 6647140 (99.80%)

The data quality is not optimal but i guess that having no alignment using paired end is not due to that fact and probably parameter should be tuned.
Any one could give me some insight about the optimal setting for the paired end alignment ?
Thanks in advance,
Best,
ramzi
Leave a comment:
Ben Langmead replied

11-02-2009, 07:29 PM
Originally posted by Xi Wang View Post

I am still wondering whether the seed region is defined only for counting the mismatches or not.

Yes. The setting for -l matters for the -n limit but not for the -e limit.

Originally posted by Xi Wang View Post

If I want to just use the quality score criterion, and set -l equal to 0, does it work?

No, -l must be set to 5 or greater.

Ben
Leave a comment:

Previous 1 8 15 16 17 18 19 20 21 28 34 template Next

Recent Advances in Sequencing Analysis Tools

by seqadmin

The sequencing world is rapidly changing due to declining costs, enhanced accuracies, and the advent of newer, cutting-edge instruments. Equally important to these developments are improvements in sequencing analysis, a process that converts vast amounts of raw data into a comprehensible and meaningful form. This complex task requires expertise and the right analysis tools. In this article, we highlight the progress and innovation in sequencing analysis by reviewing several of the...
- Channel: Articles
05-06-2024, 07:48 AM
Essential Discoveries and Tools in Epitranscriptomics

by seqadmin

The field of epigenetics has traditionally concentrated more on DNA and how changes like methylation and phosphorylation of histones impact gene expression and regulation. However, our increased understanding of RNA modifications and their importance in cellular processes has led to a rise in epitranscriptomics research. “Epitranscriptomics brings together the concepts of epigenetics and gene expression,” explained Adrien Leger, PhD, Principal Research Scientist...
- Channel: Articles
04-22-2024, 07:01 AM

Topics	Statistics	Last Post
A Closer Look at the Enigmatic Genomes of Oikopleura dioica by seqadmin Started by seqadmin, Yesterday, 06:35 AM	0 responses 15 views 0 likes	Last Post by seqadmin Yesterday, 06:35 AM
Advanced Epigenome Editing Platform Explores Gene Regulation Mechanisms by seqadmin Started by seqadmin, 05-09-2024, 02:46 PM	0 responses 21 views 0 likes	Last Post by seqadmin 05-09-2024, 02:46 PM
Telomere Maintenance by PARP1: A New Perspective in Cancer Research by seqadmin Started by seqadmin, 05-07-2024, 06:57 AM	0 responses 18 views 0 likes	Last Post by seqadmin 05-07-2024, 06:57 AM
Enhanced Neoantigen Detection: Introducing NeoHunter by seqadmin Started by seqadmin, 05-06-2024, 07:17 AM	0 responses 19 views 0 likes	Last Post by seqadmin 05-06-2024, 07:17 AM

Seqanswers Leaderboard Ad

Announcement

Leave a comment:

Leave a comment:

Leave a comment:

Leave a comment:

Leave a comment:

Leave a comment:

Leave a comment:

Leave a comment:

Leave a comment:

Leave a comment:

Leave a comment:

Leave a comment:

Leave a comment:

Leave a comment:

Leave a comment:

Latest Articles

ad_right_rmr

News