Unconfigured Ad

Collapse
X
 
  • Filter
  • Time
  • Show
Clear All
new posts
  • rboettcher
    Member
    • Oct 2010
    • 71

    Short sequence alignment

    Hi all,
    I am currently looking for a software to do alignments of short reads against other short sequences (so not the whole genome). The only problem is: I want to do a lot of these alignments. So instead of aligning Xmio reads to one big sequence, my goal is to align them against 10k (or so) very short sequences. Does anybody know a suitable software?

    Regards

    €: I should add that I'm not aiming on multiple sequence alignments.
    Last edited by rboettcher; 03-07-2011, 05:58 AM.
  • henry.wood
    Member
    • Apr 2010
    • 63

    #2
    We have done something like this using bwa. We made an artificial reference 'genome' from 3000 short (5-15Kb) fragments treating each fragment as a chromosome. We then aligned several lanes of Illumina data against this quite happily.

    Comment

    • rboettcher
      Member
      • Oct 2010
      • 71

      #3
      I would prefer something not relying on prior indexing, so a fast implementation of Needleman-Wunsch or Hirschberg-algorithm would be sufficient I suppose, as long as it is accessible by console.

      Comment

      • zee
        NGS specialist
        • Apr 2008
        • 249

        #4
        Novoalign does full Needleman-Wunsch after seeding matches. You can also map short reads to small fragments.

        Comment

        • sparks
          Senior Member
          • Mar 2008
          • 126

          #5
          Adding to Zees comment, Novoalign indexing step is very fast for small references so it shouldn't be an issue.

          I was also wondering if you're interested in alignments that overlap rather than are contained in your short target sequences. If you are then you could add N's to both ends of the short sequences that you index.

          Comment

          • rboettcher
            Member
            • Oct 2010
            • 71

            #6
            No overlaps, just aligning the sequences to a number of short reference sequences. Unfortunately, I couldn't get Novoalign to run on my system

            Comment

            • sparks
              Senior Member
              • Mar 2008
              • 126

              #7
              What problem did you have with Novoalign?

              Comment

              • ericgdp
                Junior Member
                • Mar 2010
                • 2

                #8
                May be you can use Abacas.

                Comment

                • biznatch
                  Senior Member
                  • Nov 2010
                  • 124

                  #9
                  PatternHunter?

                  Wang, Yuezhou, et al. “Pharmacological Targeting of Vacuolar H+-ATPase via Subunit V1G Combats Multidrug-Resistant Cancer.” Cell Chemical Biology, 2020, doi:10.1016/j.chembiol.2020.06.011. Abstract Multidrug resistance (MDR) in cancer remains a major challenge for the success of chemotherapy. Natural products have been a rich source for the discovery of drugs against MDR cancers. Here, we applied high-throughput cytotoxicity...

                  Comment

                  • rboettcher
                    Member
                    • Oct 2010
                    • 71

                    #10
                    Thank you all very much for your suggestions, but I guess none of the aligners mentioned is really meant to do what I intend it to do, as far as I can tell from studying the manuals/faqs.
                    So after some consideration, I decided to implement a simple NW-algorithm and after switching from Perl to Java the speed is acceptable.
                    I had hoped I could skip this step but it seems there is no real alternative.
                    Thanks anyway, I guess you will read about my next problems soon enough on the forums

                    Best Regards

                    Comment

                    Latest Articles

                    Collapse

                    • SEQadmin2
                      From Collection to Sequencing: Why Sample Preparation and Preservation Define Sequencing Data
                      by SEQadmin2


                      Data variability is still an issue in sequencing technologies despite the advances in reproducibility and accuracy of these platforms. But the problem does not originate in the sequencing itself, but in the previous steps, before the sample reaches the sequencer.


                      The first step is collection, followed by preservation and sample preparation for analysis. Most scientists overlook those steps, but not being careful might just be skewing the experiment’s results.
                      ...
                      Yesterday, 10:05 AM
                    • SEQadmin2
                      Single-Cell Sequencing at an Inflection Point: Early Impacts of New Platforms and Emerging Trends
                      by SEQadmin2


                      With the launch of new single-cell sequencing platforms in 2026, the field stands at an exciting inflection point. This article surveys the most impactful advances in the field and discusses how they’re reshaping research in cancer, immunology, and beyond.


                      Introduction

                      Single-cell sequencing technologies have undergone remarkable advances over the past decade, transitioning from low-throughput experimental approaches to highly scalable platforms capable of...
                      05-22-2026, 06:42 AM
                    • SEQadmin2
                      Environmental Genomics in the Age of NGS: From Microbes to Conservation Strategies
                      by SEQadmin2

                      Studying ecosystems means dealing with complex, multi-species communities that are hard to observe at scale. This complexity, however, hides many important questions to be answered, from how biogeochemical cycles work and how climate change can affect species distribution to how conservation strategies can work best.


                      Genomics, particularly since the expansion of NGS, has transformed ecosystem ecology. By sequencing environmental DNA, we can now assess biodiversity without direct...
                      05-06-2026, 09:04 AM

                    ad_right_rmr

                    Collapse

                    News

                    Collapse

                    Topics Statistics Last Post
                    Started by SEQadmin2, Yesterday, 12:03 PM
                    0 responses
                    19 views
                    0 reactions
                    Last Post SEQadmin2  
                    Started by SEQadmin2, Yesterday, 11:40 AM
                    0 responses
                    14 views
                    0 reactions
                    Last Post SEQadmin2  
                    Started by SEQadmin2, 05-28-2026, 11:40 AM
                    0 responses
                    29 views
                    0 reactions
                    Last Post SEQadmin2  
                    Started by SEQadmin2, 05-26-2026, 10:12 AM
                    0 responses
                    31 views
                    0 reactions
                    Last Post SEQadmin2  
                    Working...