Seqanswers Leaderboard Ad

**doxologist** · 03-01-2009, 04:58 AM

IMHO, I think method with Burrows-Wheeler transform are now the most optimal.

Bowtie is quick and easy...
bwa in the MAQ package is more flexible and extensive.

Hope this helps.

**doxologist** · 03-01-2009, 04:59 AM

oh.. by the way... the prev recommendations is for data with Illumina and 454. If you want to analyze Solid data, different software must be used for colorspace. Visit the solid thread for more info there.

**found** · 03-01-2009, 05:41 PM

Thanks l lot. it is Illumina data.
when you mentioned "Burrows-Wheeler transform", is it a tool, or an algorithm? or it is just the full name of "bwa in the MAQ package"?

**jkbonfield** · 03-03-2009, 01:20 AM

Burrows-Wheeler transform is an algorithm, originally used for data compression in tools like bzip. The interest for bioinformatics is that with more recent tweaks (something called "FM Index") it can be both a compression tool and also an indexing tool.

This indexing tool is what makes it good for short-read aligners, bwa and bowtie being two such examples. Whether this method is optimal for you depends on the size of the reference genome you're comparing against.

As for why there are so many tools, well it's harder than you'd think. Sure it's easy to simply align data, but to do it for read-pair data with short indels taking into account multiple matches and computing probabilities that the sequence has been misaligned (etc) all adds up to a complex task. This has lead to a lot of competition between groups. I expect in time the number will dwindle as we get winners and losers.

**doxologist** · 03-03-2009, 07:35 AM

I agree... there would eventually be a few dominant tools for each type of information space... which would take the best algorithms and be the most data-friendly. The initiatives for common format would make this process much more efficient.

I think the future added value would no longer just be alignment, but what's downstream. SNP detection, paired ends, indel detection, handling large numbers of samples, etc.

Topics	Statistics	Last Post
Expanding the Horizons of Cellular Research with the Single Cell Atlas by seqadmin Started by seqadmin, 04-25-2024, 11:49 AM	0 responses 19 views 0 likes	Last Post by seqadmin 04-25-2024, 11:49 AM
Genetic Variants and Diabetes Risk in Childhood Cancer Survivors by seqadmin Started by seqadmin, 04-24-2024, 08:47 AM	0 responses 20 views 0 likes	Last Post by seqadmin 04-24-2024, 08:47 AM
Cancer Metastasis: A Deep Dive into Cellular Plasticity by seqadmin Started by seqadmin, 04-11-2024, 12:08 PM	0 responses 62 views 0 likes	Last Post by seqadmin 04-11-2024, 12:08 PM
Proteogenomic Profiles Offer New Clues in Prostate Cancer by seqadmin Started by seqadmin, 04-10-2024, 10:19 PM	0 responses 60 views 0 likes	Last Post by seqadmin 04-10-2024, 10:19 PM

Seqanswers Leaderboard Ad

Announcement

So many software for alignment!!!

Comment

Comment

Comment

Comment

Comment

Latest Articles

ad_right_rmr

News