Seqanswers Leaderboard Ad

**ieuanclay** · 03-10-2009, 08:35 AM

Yes - great thank you!

Ieuan

**ieuanclay** · 03-10-2009, 10:48 AM

Sorry to keep on about this, I just want to get it clear.

By default:
-k is 1, so only one (the best according -n-v-l-e) alignment is reported.
-m is unlimited

so if a read has multiple valid alignments, one of which is better than the others (fewer mismatches, though the others are still valid), and i specify -k 1 -m 1, will the best alignment be given, or will it be pumped into --maxfa ?

If I am worried about this sort of situation, should i specify --best?

**Ben Langmead** · 03-10-2009, 11:01 AM

so if a read has multiple valid alignments, one of which is better than the others (fewer mismatches, though the others are still valid), and i specify -k 1 -m 1, will the best alignment be given, or will it be pumped into --maxfa ?

In that situation, no alignments will be printed and the read will go into the --maxfa/--maxfq file(s).

If I am worried about this sort of situation, should i specify --best?

That won't help in this case because --best doesn't change which alignments are considered valid; rather, it changes which valid alignments are reported by Bowtie. The -v/-n/-l/-e options are the only ones that change which alignments are considered valid by Bowtie. If the set of valid alignments happens to be stratified (e.g., there's an exact hit and a bunch of 1-mismatch hits), the existence of the better alignments doesn't invalidate the worse ones.

If this poses a problem, I'd be interested to hear more about what you're looking for...

Thanks,
Ben

**ieuanclay** · 03-11-2009, 05:33 AM

Hi Ben,

Thanks for your help so far - I am relatively new to mapping (but not so new that I am not impressed by bowtie!), so please excuse any dopey questions...

My worry is that I will lose alignments which have perfect alignments if I also have other near matches. I guess I can make -n smaller (1 say). But I suppose the caveat is that if there is an exact match and a near match, you cannot say which is the correct one - sequencing error etc... - and so it is conservative to reject the read as having multiple alignments?

What I work on it is really important to be very sure about where the reads map... so maybe it would be good to keep -n at 1 and be more confident about the reads? I don't want to have to refer back to alignment confidences in analyses later on, but say that beyond a certain confidence threshold I am happy with them all. If I am going to reduce -n, should I also reduce -l to 20 or 25?

I guess what I really want to know is: how stringent are the default alignment settings? Can I make them more stringent without losing a lot of 'true' (but imperfect) alignments?

Thanks again,

Ieuan

**Ben Langmead** · 03-11-2009, 06:02 AM

My worry is that I will lose alignments which have perfect alignments if I also have other near matches. I guess I can make -n smaller (1 say). But I suppose the caveat is that if there is an exact match and a near match, you cannot say which is the correct one - sequencing error etc... - and so it is conservative to reject the read as having multiple alignments?

I see your concern. In your case, you may want to consider running Bowtie with -a --nostrata (or -k <some int> --nostrata) and then postprocessing the results in whatever way you think is appropriate for your application. If you'd like to reject reads on the basis of the number of alignments found in the *best* match stratum (as opposed to all strata), you can do that with a script.

Another alternative is to do multiple Bowtie runs with decreasingly stringent alignment policies (e.g. -n 0, then -n 1, etc). The input to each run might the the --unfq reads from the run before.

I guess what I really want to know is: how stringent are the default alignment settings? Can I make them more stringent without losing a lot of 'true' (but imperfect) alignments?

The default alignment policy is -n 2 -l 28 -e 70, which mimics Maq's defaults (with the caveat that Maq actually lets through some alignments with 3 mismatches in the seed). Whether you can make the policy more stringent without losing true alignments depends on how different your query organism is from the reference. Intuitively, the default policy has no problem finding alignments where there are 2 SNPs very close together, but might have a problem finding alignments where there are 3 SNPs very close together. The same goes for -n 1 and 1 SNP vs. 2 SNPs. It's up to you to determine how well those policies fit your problem.

Hope that helps,
Ben

**What_Da_Seq** · 03-11-2009, 07:57 AM

How does Bowtie handle ambiguous bases in the refgenome

Does anybody have experience in preparing a Bowtie search index where certain bases have been modified with ambiguous bases like "Y" which stands for "C" or "T" and if so will these locations be called matches or missmatches if the to be aligned Solexa read has either a "C" or a "T" at that position.

Thanks

**Ben Langmead** · 03-11-2009, 08:10 AM

The Bowtie indexing step elides stretches of ambiguous bases in the reference. As a result, alignments that overlap an ambiguous base in the reference are never considered "valid" by Bowtie and will not be reported.

This is explained in a couple of paragraphs in the manual that are new as of 0.9.9.1:

A result of Bowtie's indexing strategy is that alignments involving one or more ambiguous reference characters (N, -, R Y, etc.) are considered invalid by Bowtie, regardless of the alignment policy. This is true only for ambiguous characters in the reference; alignments involving ambiguous characters in the read are legal, subject to the alignment policy.

Also, alignments that "fall off" the reference sequence are not considered legal by Bowtie, though some such alignments will become legal once gapped alignment is implemented.

**What_Da_Seq** · 03-11-2009, 12:39 PM

Thanks Ben. I could not identify an option for "bowtie-build" that is geared towards maximum efficiency (not speed nor memory efficiency) in generating alignments (least amount of non-aligned reads) in the Bowtie alignment.
Your help is appreciated.

Thanks

**Ben Langmead** · 03-11-2009, 12:46 PM

Yes, all bowtie-build options are identical in terms of the index's ability to generate alignments (except those that have slight, non-specifics effect like --ntoa or --oldpmap).

**tniranj1** · 03-11-2009, 10:48 PM

Help with installation

I'm new to next-gen sequencing and have started playing around with different alignment tools for data that will soon be coming in to my lab. From what I've heard, Bowtie sounds perfect, and I appreciate the speedy feedback that's been made available to the community.
I do have a slight installation problem. I get the following error during "Make".

SeqAn-1.1/seqan/basic/basic_generated_forwards.h:507: error: parse error before numeric constant
SeqAn-1.1/seqan/basic/basic_generated_forwards.h:761: confused by earlier errors, bailing out
make: *** [bowtie-build] Error 1

I installed the platform-independent version on my Mac (OS 10.3.9... yes, it's old I know, we're upgrading soon). Appreciate any help with resolving this.

-TiN

**Ben Langmead** · 03-12-2009, 04:30 AM

What version of g++ do you have (try 'g++ -v') and what version of Bowtie are you trying to compile? Is there another g++ version installed besides the default? I'm not familiar with 10.3, but you can try running g++3 and g++4 and see if either of those work.

Thanks,
Ben

**tniranj1** · 03-12-2009, 09:07 AM

I'm using gcc version 3.3 with bowtie 0.9.9.1. Do I need version 4 or higher for g++ in order for installation of bowtie to work, or is 3 sufficient?
Thanks,
-TiN

**Ben Langmead** · 03-12-2009, 09:18 AM

Well, the oldest g++ I've used is 3.4.6, which works without warnings. I just tried 3.2.3 and got a bunch of warnings and errors; mostly in the SeqAn headers. So, yes, if you happen to have a newer g++ version somewhere on your machine then please try that. E.g., try typing g++ then hitting tab to see if there's something called g++4 or g++34 or similar. If there is something called g++34, for example, then make bowtie using 'make GCC_SUFFIX=34'. Let me know if that doesn't work; I can try to fix this in a future version of Bowtie.

Thanks,
Ben

**tniranj1** · 03-12-2009, 05:17 PM

I just installed gcc 3.4.6 and changed the etc/profile $PATH to reflect the update. When I ran make again, significantly more SeqAn-1.1 errors popped up (too much to post). There is no suffix to the new g++ file. Should I shoot for gcc4.x or would it be more appropriate to wait until our Leopard computer comes in... I would prefer to start testing with this computer now, though.
Really appreciate the help!
-TiN

**Ben Langmead** · 03-12-2009, 05:23 PM

Darn! Sorry to waste your time.

I can testify that the gcc4 and gcc346 versions on Leopard (from the developer's tools) work fine for me, as do the various gcc4 versions I've tried on Linux. I'm sorry that that 3.4.6 doesn't seem to be working under 10.3. I will add it to my TODO list to address some of that problematic SeqAn code before the next release. In the meantime, since 3.4.6 didn't work, waiting for your Leopard computer is the option that has the least chance of wasting more of your time.

Ben

Topics	Statistics	Last Post
New Model Aims to Explain Polygenic Diseases by Connecting Genomic Mutations and Regulatory Networks by seqadmin Started by seqadmin, Yesterday, 05:31 AM	0 responses 10 views 0 likes	Last Post by seqadmin Yesterday, 05:31 AM
Small Blood Stem Cell Subset Linked to Immune System Aging by seqadmin Started by seqadmin, 10-24-2024, 06:58 AM	0 responses 20 views 0 likes	Last Post by seqadmin 10-24-2024, 06:58 AM
New AI Model Designs Synthetic DNA Switches for Targeted Gene Expression in Specific Cell Types by seqadmin Started by seqadmin, 10-23-2024, 08:43 AM	0 responses 48 views 0 likes	Last Post by seqadmin 10-23-2024, 08:43 AM
Microbes in Urban Spaces Adapt to Disinfectants and Scarce Resources by seqadmin Started by seqadmin, 10-17-2024, 07:29 AM	0 responses 58 views 0 likes	Last Post by seqadmin 10-17-2024, 07:29 AM

Seqanswers Leaderboard Ad

Announcement

Comment

Comment

Comment

Comment

Comment

Comment

Comment

Comment

Comment

Comment

Comment

Comment

Comment

Comment

Comment

Latest Articles

ad_right_rmr

News