Hello Ryan,
FYI, I made multiple improvements in Bowtie 0.9.8 that will help prevent such problems for people in the future. First of all, the Mac binary available from sourceforge is now "Universal" for i386 and x86_64, whereas it was previously i386 only, which forced 64-Mac users to start from source. Also, by default bowtie-build will now automatically look for values for the --bmax, --dcv and --packed parameters that fit into the memory of the computer it's running on. This obviates the tedious trial-and-error of trying larger and larger --bmaxdivn. Also, bowtie-build and bowtie-build-packed are now both "in" bowtie-build - they are no longer separate binaries. Packed mode is activated by passing -p/--packed to bowtie-build.
I hope these improvements help you and others.
Thanks,
Ben
FYI, I made multiple improvements in Bowtie 0.9.8 that will help prevent such problems for people in the future. First of all, the Mac binary available from sourceforge is now "Universal" for i386 and x86_64, whereas it was previously i386 only, which forced 64-Mac users to start from source. Also, by default bowtie-build will now automatically look for values for the --bmax, --dcv and --packed parameters that fit into the memory of the computer it's running on. This obviates the tedious trial-and-error of trying larger and larger --bmaxdivn. Also, bowtie-build and bowtie-build-packed are now both "in" bowtie-build - they are no longer separate binaries. Packed mode is activated by passing -p/--packed to bowtie-build.
I hope these improvements help you and others.
Thanks,
Ben
), there's a need to discount evidence from alignments whose placement was ambiguous. That's what field #7 does (or will do, when we've settled on what should actually go there; we've marked it as "reserved" for now). The good news is that we can reasonably assume deep coverage, which means that we don't have to rely on the value in field #7 for just one read to know how reliable its evidence is. We can, for example, take the max of the values for field #7 for all the reads that span the column. That's just an example; probably something smarter than max is needed. In short, I argue that A) the case where field #7 is very wrong is rare, and B) in cases where it is very wrong, it can probably correct it by looking at nearby aligned reads. So I think the problem is manageable.
Comment