Unconfigured Ad

Collapse
X
 
  • Filter
  • Time
  • Show
Clear All
new posts
  • ypsd
    Junior Member
    • Mar 2010
    • 1

    How good bowtie can map SOLiD data?

    I tried the following command to map the attached fq file (single end, ChIP-seq).

    bowtie -C -p $threads -n 3 -e 160 -a -m 1 --best --strata $BOWTIE2_INDEX/hg19_c s_6.fq -S s_6n.sam 2>&1 | tee s_6n.log

    The mappable reads are of only 35%.

    # reads processed: 2500
    # reads with at least one reported alignment: 885 (35.40%)
    # reads that failed to align: 1376 (55.04%)
    # reads with alignments suppressed due to -m: 239 (9.56%)
    Reported 885 alignments to 1 output stream(s)

    The following command gives 38% (as -e is large)

    bowtie -p 8 -C -m 10 -l 27 -n 3 -e 950 -k 1 $BOWTIE_INDEX/hg19_c s_6.fq -S s_6.sam 2>&1 | tee s_6.log
    # reads processed: 2500
    # reads with at least one reported alignment: 972 (38.88%)
    # reads that failed to align: 1275 (51.00%)
    # reads with alignments suppressed due to -m: 253 (10.12%)
    Reported 972 alignments to 1 output stream(s)

    But supposedly, SOLiD pipeline should be able map this file for over 50%. Does anybody know how good bowtie is to map SOLiD files? Are there any other parameters that can improve the results?

    I see that people said, bfast is better than bowtie for default parameters. By tuning the parameter, can the results of bowtie be improved? Thanks!

    Attached Files
  • Jean
    Member
    • Nov 2008
    • 37

    #2
    We've mapped SOLiD RNAseq data with several mappers (Bowtie, BWA, bfast) and never got above 25-35% mapped. Others we have spoken to are getting similar results (or worse).

    Would be curious to know if anyone has mapped better.

    Comment

    • iamli
      Junior Member
      • Feb 2012
      • 1

      #3
      we mapped SOLiD with low mapped rate too!

      we mapped SOLiD with low mapped rate too!
      I used tophat to analyse it ! They are colorspace pair end reads.
      Originally posted by Jean View Post
      We've mapped SOLiD RNAseq data with several mappers (Bowtie, BWA, bfast) and never got above 25-35% mapped. Others we have spoken to are getting similar results (or worse).

      Would be curious to know if anyone has mapped better.

      Comment

      • kopi-o
        Senior Member
        • Feb 2008
        • 319

        #4
        I have mapped 70-80% with BioScope for some SOLiD RNA-seq data (paired or not). But in many cases also much less. With Bowtie maybe 50% in the best cases.

        Comment

        • pmiguel
          Senior Member
          • Aug 2008
          • 2328

          #5
          I was talking to someone the other day who said they got >50% mapping with Novoalign. If I recall correctly. Actually they seemed to read SeqAnswers, so maybe they would chime in?

          --
          Phillip

          Comment

          • morellr
            Junior Member
            • Mar 2010
            • 5

            #6
            I have 3 RNA-seq libraries generated on Solid5500. 64%-72% of the reads mapped with Lifescope.

            Comment

            • brentp
              Member
              • Apr 2010
              • 72

              #7
              I got best results with -v1 -m1 with bowtie on colorspace.

              run bowtie then bfast on colorspace reads. Contribute to brentp/bowfast development by creating an account on GitHub.

              Comment

              • gringer
                David Eccles (gringer)
                • May 2011
                • 845

                #8
                We've been getting 30-50% mapping for SOLiD4 reads using Bowtie. It has been as low as ~9% for a very low-read sample (~400k reads multiplexed in with other samples totalling ~400M reads).

                Comment

                • hajime
                  Member
                  • Mar 2011
                  • 14

                  #9
                  Based on my mapping results,the same RNA-seq sample mapped by tophat and lifescope got ~50% and ~80% mappable reads, respectively, even we have used the additional ECC primer.

                  I guess the Refcor module in lifescope may improve the mappable rate.
                  Yi John Huang (PhD student)
                  886-3-2118800 ext. 3731
                  Graduate Institute of Biomedical Science, Chang Gung University

                  Comment

                  • gringer
                    David Eccles (gringer)
                    • May 2011
                    • 845

                    #10
                    sure, the process used by Lifescope (which includes end clipping) increases the mappable reads. That doesn't necessarily mean it's a better result -- I can increase the sensitivity of a clinical test to 100% by always reporting a positive result.

                    Comment

                    • colindaven
                      Senior Member
                      • Oct 2008
                      • 417

                      #11
                      For human exome reads we generally get the following figures:

                      Bowtie ~35%
                      BWA 0.5.x ~40%
                      Clcbio ~50-60%
                      NovoalignCS ~55-65% *
                      Lifescope ~70-80% *



                      *use end clipping.
                      Iteratively trimmed reads are still very useful, provided you don't make them too short.

                      Bowtie1 is not that useful for long reads, as it only allows max 3 mismatches in the seed. Apparently more mismatches are necessary for useful mapping of Solid data as said by someone about using bwa successfully.

                      As discussed here before, % alignments isn't really the best quality criterion for an aligner. We prefer NovoalignCS to date based on clean alignments and SNPs called versus the others via a standard pipeline.

                      Comment

                      • carmeyeii
                        Senior Member
                        • Mar 2011
                        • 137

                        #12
                        So sad! I have been tweaking the parameters on Bowtie to get some huge runs to map decently, but so far the max I've gotten is 3% using -l 17 -n 3 !!!! Do you think it might be better to use -v ?

                        Seeded quality full-index search: 16:45:26
                        # reads processed: 60132812
                        # reads with at least one reported alignment: 1882083 (3.13%)
                        # reads that failed to align: 58250729 (96.87%)
                        Reported 4232170 paired-end alignments to 1 output stream(s)
                        Time searching: 16:45:30
                        Overall time: 16:45:30

                        Comment

                        Latest Articles

                        Collapse

                        • SEQadmin2
                          From Collection to Sequencing: Why Sample Preparation and Preservation Define Sequencing Data
                          by SEQadmin2


                          Data variability is still an issue in sequencing technologies despite the advances in reproducibility and accuracy of these platforms. But the problem does not originate in the sequencing itself, but in the previous steps, before the sample reaches the sequencer.


                          The first step is collection, followed by preservation and sample preparation for analysis. Most scientists overlook those steps, but not being careful might just be skewing the experiment’s results.
                          ...
                          06-02-2026, 10:05 AM
                        • SEQadmin2
                          Single-Cell Sequencing at an Inflection Point: Early Impacts of New Platforms and Emerging Trends
                          by SEQadmin2


                          With the launch of new single-cell sequencing platforms in 2026, the field stands at an exciting inflection point. This article surveys the most impactful advances in the field and discusses how they’re reshaping research in cancer, immunology, and beyond.


                          Introduction

                          Single-cell sequencing technologies have undergone remarkable advances over the past decade, transitioning from low-throughput experimental approaches to highly scalable platforms capable of...
                          05-22-2026, 06:42 AM
                        • SEQadmin2
                          Environmental Genomics in the Age of NGS: From Microbes to Conservation Strategies
                          by SEQadmin2

                          Studying ecosystems means dealing with complex, multi-species communities that are hard to observe at scale. This complexity, however, hides many important questions to be answered, from how biogeochemical cycles work and how climate change can affect species distribution to how conservation strategies can work best.


                          Genomics, particularly since the expansion of NGS, has transformed ecosystem ecology. By sequencing environmental DNA, we can now assess biodiversity without direct...
                          05-06-2026, 09:04 AM

                        ad_right_rmr

                        Collapse

                        News

                        Collapse

                        Topics Statistics Last Post
                        Started by SEQadmin2, Today, 08:59 AM
                        0 responses
                        8 views
                        0 reactions
                        Last Post SEQadmin2  
                        Started by SEQadmin2, 06-02-2026, 12:03 PM
                        0 responses
                        21 views
                        0 reactions
                        Last Post SEQadmin2  
                        Started by SEQadmin2, 06-02-2026, 11:40 AM
                        0 responses
                        17 views
                        0 reactions
                        Last Post SEQadmin2  
                        Started by SEQadmin2, 05-28-2026, 11:40 AM
                        0 responses
                        29 views
                        0 reactions
                        Last Post SEQadmin2  
                        Working...