Seqanswers Leaderboard Ad

Collapse

Announcement

Collapse
No announcement yet.
X
 
  • Filter
  • Time
  • Show
Clear All
new posts

  • ask for gapped (indel) alignment software

    i have now single-end solexa data and later will be also working on PE.

    i've tried soap2 for the -g 5 option for gapped alignment, however, on known indels, there's nothing properly mapped. although i've seen on seqanswers someone did gapped alignment successfully (but on soap rather than soap2?). and the developer says soap2 can do gapped PET alignment, but "gapped" means the interval between the ends rather than indels?

    bowtie says it doesn't support gapped alignment yet.

    may anyone tell me which free software can do gapped alignment (for indels) on both SE and PE data? many thx!
    Last edited by polyhedron; 10-14-2010, 09:09 AM.

  • #2
    I may say that BWA accepts indel...
    Francois Sabot, PhD

    Be realistic. Demand the Impossible.
    www.wikiposon.org

    Comment


    • #3
      Novoalign, BWA, BFAST

      Oh, Novoalign is not exactly free. You can get a free trial from them, though, and it is not very expensive.
      Last edited by Michael.James.Clark; 10-14-2010, 01:18 AM.
      Mendelian Disorder: A blogshare of random useful information for general public consumption. [Blog]
      Breakway: A Program to Identify Structural Variations in Genomic Data [Website] [Forum Post]
      Projects: U87MG whole genome sequence [Website] [Paper]

      Comment


      • #4
        Thanks very much, François and Michael!

        I've been testing bwa these days, and find bwa have considerable better mappability than soap2, and good with indels. The trade-off is that bwa is 3 or 4 times slower than soap2, and I haven't find so flexible output options as in soap2, like

        -r INT How to report repeat hits, 0=none; 1=random one; 2=all, [1]
        Therefore, I think when I've got really huge data, I would rather run soap2 first to get some basic idea and results from my reads, then just let bwa running for days to get more precise results.
        Last edited by polyhedron; 10-20-2010, 06:44 AM.

        Comment


        • #5
          SHRiMP is very robust to INDELs, but is very slow (as it performs Smith-Waterman alignments).

          Comment


          • #6
            -r INT How to report repeat hits, 0=none; 1=random one; 2=all, [1]
            you can try options -n and -N in sampe or samse subrouting!

            Comment


            • #7
              Originally posted by polyhedron View Post
              Therefore, I think when I've got really huge data, I would rather run soap2 first to get some basic idea and results from my reads, then just let bwa running for days to get more precise results.
              Days? BWA is very fast and accurate. How much data are you trying to align? On what environment? A lane of hiseq (75bp single ended) should take less than 1 day on a typical 8 core machine. You can further reduce the running times by splitting and computing the data into multiple machines.
              -drd

              Comment


              • #8
                Yeah, I'm with drio on this one. Something is wrong with your approach using BWA if it's taking days, even for a very large data set. You may not be running it in an optimal fashion for your computational resources if that's the case.

                All three gapped aligners that I mentioned--Novoalign, BFAST, BWA--are quite fast with Illumina reads, so if it seems slow, there may be an incorrect setting or it may be getting run in a suboptimal way.
                Mendelian Disorder: A blogshare of random useful information for general public consumption. [Blog]
                Breakway: A Program to Identify Structural Variations in Genomic Data [Website] [Forum Post]
                Projects: U87MG whole genome sequence [Website] [Paper]

                Comment


                • #9
                  Depending on the size of your reference sequence and amount of sequence data you could test if my program R2R is your solution.
                  Find it at: http://milne.ruc.dk/R2R/

                  Comment


                  • #10
                    there is a new mapper available called STAMPY (http://www.well.ox.ac.uk/project-stampy). the paper seems promising but I haven't tested it myself.

                    Comment


                    • #11
                      The best tools I've used for alignment of small indels are Stampy and Shrimp2. We've validated quite a number of these.
                      Unfortunately they are slow as well.

                      As a follow-up I've been using Dindel on Stampy aligned data for further testing. No wet-lab data on this yet though.

                      Comment


                      • #12
                        Novoalign and BWA work quite well for this problem. BWA is faster than the free version of Novoalign (not support MPI), but Novoalign maybe more accurate.
                        It might be worth taking a look at these surveys - though they are not the most up to date:


                        Comment

                        Latest Articles

                        Collapse

                        • seqadmin
                          Strategies for Sequencing Challenging Samples
                          by seqadmin


                          Despite advancements in sequencing platforms and related sample preparation technologies, certain sample types continue to present significant challenges that can compromise sequencing results. Pedro Echave, Senior Manager of the Global Business Segment at Revvity, explained that the success of a sequencing experiment ultimately depends on the amount and integrity of the nucleic acid template (RNA or DNA) obtained from a sample. “The better the quality of the nucleic acid isolated...
                          03-22-2024, 06:39 AM
                        • seqadmin
                          Techniques and Challenges in Conservation Genomics
                          by seqadmin



                          The field of conservation genomics centers on applying genomics technologies in support of conservation efforts and the preservation of biodiversity. This article features interviews with two researchers who showcase their innovative work and highlight the current state and future of conservation genomics.

                          Avian Conservation
                          Matthew DeSaix, a recent doctoral graduate from Kristen Ruegg’s lab at The University of Colorado, shared that most of his research...
                          03-08-2024, 10:41 AM

                        ad_right_rmr

                        Collapse

                        News

                        Collapse

                        Topics Statistics Last Post
                        Started by seqadmin, 03-27-2024, 06:37 PM
                        0 responses
                        13 views
                        0 likes
                        Last Post seqadmin  
                        Started by seqadmin, 03-27-2024, 06:07 PM
                        0 responses
                        11 views
                        0 likes
                        Last Post seqadmin  
                        Started by seqadmin, 03-22-2024, 10:03 AM
                        0 responses
                        53 views
                        0 likes
                        Last Post seqadmin  
                        Started by seqadmin, 03-21-2024, 07:32 AM
                        0 responses
                        69 views
                        0 likes
                        Last Post seqadmin  
                        Working...
                        X