Seqanswers Leaderboard Ad

Collapse

Announcement

Collapse
No announcement yet.
X
 
  • Filter
  • Time
  • Show
Clear All
new posts

  • Aligners, aligners, aligners

    Hi all,
    I'm working with Illumina GAII and it's has been clear for months that eland is no more the fastest (nor the best) aligner available (and I wonder why Illumina doesn't include an alternative tool...).
    BTW, there are many aligners out there and I'm puzzled on which one should I use. I've read the paper by Palimeri & Schlötterer which is only telling a part of the whole story (some aligners are not mentioned, options for tested aligners may not be optimal...). In the end I've been focusing mainly on bowtie and bwa, mainly for speed, then for SAM compliance (and I'm only assuming SAM will be a standard for all aligners someday).
    I know there are many developers reading this forum, and probably they will tell that their software is better... what about the others? What would you choose and why? Also, who has played with all the CLI options available to see how results vary?
    d

  • #2
    I would recommend maq, bwa, bowtie, novoalign and soap2. So far as I know, all these are under heavy daily uses by people from different centers. You might also find the following two pages (here and here) interesting. They are quite lengthy and one page is less related to the topic, though.
    Last edited by lh3; 10-16-2009, 10:30 AM.

    Comment


    • #3
      I second lh3's list - the five tools mentioned produce results that are in broad agreement with each other, despite using radically different algorithms. Each has different strengths and weaknesses, but provided that you pick one that is compatible for your data (for example Bowtie doesn't support Colorspace yet), you should get good results for your downstream analysis.

      Comment


      • #4
        I would recommend MAQ, BFAST and BWA. I recommend MAQ because it was the early standard and a compromise between ease-of-use and power. I recommend BFAST because I am the author as well as many other documented reasons. I recommend BWA because it is fast, good for low-error data, as well as small (<3bp) indels.

        Note: I dislike against any aligner that specifically ignores insertions and deletions (not just the small ones), which are quickly becoming an important mutation mechanism in cancer (for example). This makes me biased against aligners that do not use a Smith-Waterman local alignment to detect errors/variants (especially for ABI color space).

        Specify up front what you want to find (if only perfect matching reads with no variants, that's fine), then choose your aligner. Of course, aligners tuned to find only reads with up to one error will be faster and easier to use (so use them in those instances)!

        Comment


        • #5
          The 'robust' list posted above is a good starting point, with another vote for Novoalign. Completely depends on what your aims are -- we've added Mosaik to the list because we needed support for IUPAC codes, but also started with some of the algorithms that come prepackaged with EC2/Cloud scripts just for convenience.

          Comment


          • #6
            novoalign and bfast appear to be quite similar to me. nils, care to comment on the differences?

            Comment


            • #7
              Originally posted by dawe View Post
              Hi all,
              BTW, there are many aligners out there and I'm puzzled on which one should I use.
              The list given above is a good start for aligners.

              Depending on your needs, you might also want to look into assemblers. E.g.: using Solexas to polish a Sanger or 454 assembly. Programs that map Solexas to existing assemblies ideally dump the result in a format you can then directly use in finishing programs to manually adjust the last problems remaining.

              B.

              Comment

              Latest Articles

              Collapse

              • seqadmin
                Strategies for Sequencing Challenging Samples
                by seqadmin


                Despite advancements in sequencing platforms and related sample preparation technologies, certain sample types continue to present significant challenges that can compromise sequencing results. Pedro Echave, Senior Manager of the Global Business Segment at Revvity, explained that the success of a sequencing experiment ultimately depends on the amount and integrity of the nucleic acid template (RNA or DNA) obtained from a sample. “The better the quality of the nucleic acid isolated...
                03-22-2024, 06:39 AM
              • seqadmin
                Techniques and Challenges in Conservation Genomics
                by seqadmin



                The field of conservation genomics centers on applying genomics technologies in support of conservation efforts and the preservation of biodiversity. This article features interviews with two researchers who showcase their innovative work and highlight the current state and future of conservation genomics.

                Avian Conservation
                Matthew DeSaix, a recent doctoral graduate from Kristen Ruegg’s lab at The University of Colorado, shared that most of his research...
                03-08-2024, 10:41 AM

              ad_right_rmr

              Collapse

              News

              Collapse

              Topics Statistics Last Post
              Started by seqadmin, Yesterday, 06:37 PM
              0 responses
              10 views
              0 likes
              Last Post seqadmin  
              Started by seqadmin, Yesterday, 06:07 PM
              0 responses
              9 views
              0 likes
              Last Post seqadmin  
              Started by seqadmin, 03-22-2024, 10:03 AM
              0 responses
              50 views
              0 likes
              Last Post seqadmin  
              Started by seqadmin, 03-21-2024, 07:32 AM
              0 responses
              67 views
              0 likes
              Last Post seqadmin  
              Working...
              X