Unconfigured Ad

Collapse
X
 
  • Filter
  • Time
  • Show
Clear All
new posts
  • found
    Junior Member
    • Feb 2009
    • 8

    So many software for alignment!!!

    Hi,

    I am new to the area of mRNA-Seq data analysis, actually also not "old" to the bioinformatics.
    Recently, I have read a paper involved in the mapping of mRNA-Seq data to the human genome by ELAND. I am very interested in the mapping result and want to do some data mining for my own purpose. And the author only gives out the original mRNA-Seq data.
    I have browsed this forum for days and learned a lot. first thing is that ELAND is not available for me. I found that someone have listed so many software for short read sequence alignment and I feel really confused about that which one is suitable for me. That's why I am here to get some suggestions from experienced expert.

    My object is to map all of these mRNA-Seq to a database which includes the human genome and my own set of sequences(56bp). At most two mismatches and no gaps are allowed. Since a small part of sequences my contain symbol '.' such as "ATCTAT.CGTACG.GCTAGTGGTGAAGG", the software should not filter '.' out and consider it as a mismatch during alignment.

    Can you recommend a software to implement this purpose. Of course, both accuracy and speed are important. I think accuracy is more important if they can not be satisfied simultaneously.
    Last edited by found; 02-27-2009, 02:03 PM.
  • doxologist
    Member
    • Jan 2009
    • 96

    #2
    IMHO, I think method with Burrows-Wheeler transform are now the most optimal.

    Bowtie is quick and easy...
    bwa in the MAQ package is more flexible and extensive.

    Hope this helps.

    Comment

    • doxologist
      Member
      • Jan 2009
      • 96

      #3
      oh.. by the way... the prev recommendations is for data with Illumina and 454. If you want to analyze Solid data, different software must be used for colorspace. Visit the solid thread for more info there.

      Comment

      • found
        Junior Member
        • Feb 2009
        • 8

        #4
        Thanks l lot. it is Illumina data.
        when you mentioned "Burrows-Wheeler transform", is it a tool, or an algorithm? or it is just the full name of "bwa in the MAQ package"?
        Last edited by found; 03-01-2009, 05:45 PM.

        Comment

        • jkbonfield
          Senior Member
          • Jul 2008
          • 146

          #5
          Burrows-Wheeler transform is an algorithm, originally used for data compression in tools like bzip. The interest for bioinformatics is that with more recent tweaks (something called "FM Index") it can be both a compression tool and also an indexing tool.

          This indexing tool is what makes it good for short-read aligners, bwa and bowtie being two such examples. Whether this method is optimal for you depends on the size of the reference genome you're comparing against.

          As for why there are so many tools, well it's harder than you'd think. Sure it's easy to simply align data, but to do it for read-pair data with short indels taking into account multiple matches and computing probabilities that the sequence has been misaligned (etc) all adds up to a complex task. This has lead to a lot of competition between groups. I expect in time the number will dwindle as we get winners and losers.

          Comment

          • doxologist
            Member
            • Jan 2009
            • 96

            #6
            I agree... there would eventually be a few dominant tools for each type of information space... which would take the best algorithms and be the most data-friendly. The initiatives for common format would make this process much more efficient.

            I think the future added value would no longer just be alignment, but what's downstream. SNP detection, paired ends, indel detection, handling large numbers of samples, etc.

            Comment

            Latest Articles

            Collapse

            • SEQadmin2
              From Collection to Sequencing: Why Sample Preparation and Preservation Define Sequencing Data
              by SEQadmin2


              Data variability is still an issue in sequencing technologies despite the advances in reproducibility and accuracy of these platforms. But the problem does not originate in the sequencing itself, but in the previous steps, before the sample reaches the sequencer.


              The first step is collection, followed by preservation and sample preparation for analysis. Most scientists overlook those steps, but not being careful might just be skewing the experiment’s results.
              ...
              06-02-2026, 10:05 AM
            • SEQadmin2
              Single-Cell Sequencing at an Inflection Point: Early Impacts of New Platforms and Emerging Trends
              by SEQadmin2


              With the launch of new single-cell sequencing platforms in 2026, the field stands at an exciting inflection point. This article surveys the most impactful advances in the field and discusses how they’re reshaping research in cancer, immunology, and beyond.


              Introduction

              Single-cell sequencing technologies have undergone remarkable advances over the past decade, transitioning from low-throughput experimental approaches to highly scalable platforms capable of...
              05-22-2026, 06:42 AM
            • SEQadmin2
              Environmental Genomics in the Age of NGS: From Microbes to Conservation Strategies
              by SEQadmin2

              Studying ecosystems means dealing with complex, multi-species communities that are hard to observe at scale. This complexity, however, hides many important questions to be answered, from how biogeochemical cycles work and how climate change can affect species distribution to how conservation strategies can work best.


              Genomics, particularly since the expansion of NGS, has transformed ecosystem ecology. By sequencing environmental DNA, we can now assess biodiversity without direct...
              05-06-2026, 09:04 AM

            ad_right_rmr

            Collapse

            News

            Collapse

            Topics Statistics Last Post
            Started by SEQadmin2, 06-02-2026, 12:03 PM
            0 responses
            19 views
            0 reactions
            Last Post SEQadmin2  
            Started by SEQadmin2, 06-02-2026, 11:40 AM
            0 responses
            14 views
            0 reactions
            Last Post SEQadmin2  
            Started by SEQadmin2, 05-28-2026, 11:40 AM
            0 responses
            29 views
            0 reactions
            Last Post SEQadmin2  
            Started by SEQadmin2, 05-26-2026, 10:12 AM
            0 responses
            31 views
            0 reactions
            Last Post SEQadmin2  
            Working...