Unconfigured Ad

Collapse
X
 
  • Filter
  • Time
  • Show
Clear All
new posts
  • skblazer
    Member
    • Feb 2009
    • 51

    Hi, Ben Langmead
    Recently I used bowtie to align some solexa reads in 100bp length. But the matched ratio is low. I want to know what's your recommendation for the option parameters for the long reads such as 100bp?

    Thank you

    Comment

    • cp229
      Junior Member
      • Jun 2010
      • 5

      Hi Ben, I am a very new beginner with bowtie. I have a question about it, hope you can help me out. When I run bowtie with 454 long reads(the read length is about 200bp), the result is very weird that nearly 98% of reads are mismatching. (I tried human, yeast and e_coli). Could you tell me that whether I can use bowtie to align the long read or not....Thx

      Comment

      • Xi Wang
        Senior Member
        • Oct 2009
        • 317

        Originally posted by cp229 View Post
        Hi Ben, I am a very new beginner with bowtie. I have a question about it, hope you can help me out. When I run bowtie with 454 long reads(the read length is about 200bp), the result is very weird that nearly 98% of reads are mismatching. (I tried human, yeast and e_coli). Could you tell me that whether I can use bowtie to align the long read or not....Thx
        The 454 reads may contain indel errors while sequencing. Also, longer reads may cover more indel variants in the reference genome. I think the indels could be the main reason why bowtie reported a low mappable ratio.
        Xi Wang

        Comment

        • Subho
          Junior Member
          • Apr 2010
          • 2

          Is it possible to get mapping quality and read quality (a single value per read, e.g. calculated out of base quality) for Bowtie?

          Comment

          • didymos
            Junior Member
            • Jun 2010
            • 8

            Originally posted by Ben Langmead View Post
            I think the problem is that you're assuming -n 0 -l 16 is going to allow any alignment with no mismatches in the first 16 colors, but the -e limit also applies. -e also needs to be set high enough so that the sum of the quality values of all the mismatched positions in your example are still <= the -e limit. If no quality values are supplied, qualities all = 30.

            Hope that helps. You may also want to consider using trimming (e.g. -3).

            Thanks,
            Ben
            Thank you Ben! and sorry for late response...
            Now it works as I want
            Best!

            tomek

            Comment

            • cp229
              Junior Member
              • Jun 2010
              • 5

              I think I`ve got that now!!Thank you very much Xi Wang.

              Comment

              • yeb3czg
                Junior Member
                • Feb 2010
                • 2

                Hi all

                I got almost the same problem with bowtie. I tried to align paired end read in fastaQ format of 84 bp
                >bowtie -q -t --solexa1.3-quals -p 2 --sam xx_index.txt -1 xx.txt -2 xx.txt -m 1 > xx.sam

                but I got the message
                "Reads file contained a pattern with more than 1024 sequence characters.
                Please truncate reads and quality values and and re-run Bowtie
                terminate called after throwing an instance of 'int'
                Aborted"

                but if I'm using one file of the paired end reads as single end for alignment it works

                There is no uncalled bases in my reads
                Any idea about this issue ?

                Thank you

                AC

                Comment

                • Xi Wang
                  Senior Member
                  • Oct 2009
                  • 317

                  Originally posted by yeb3czg View Post
                  Hi all

                  I got almost the same problem with bowtie. I tried to align paired end read in fastaQ format of 84 bp
                  >bowtie -q -t --solexa1.3-quals -p 2 --sam xx_index.txt -1 xx.txt -2 xx.txt -m 1 > xx.sam

                  but I got the message
                  "Reads file contained a pattern with more than 1024 sequence characters.
                  Please truncate reads and quality values and and re-run Bowtie
                  terminate called after throwing an instance of 'int'
                  Aborted"

                  but if I'm using one file of the paired end reads as single end for alignment it works

                  There is no uncalled bases in my reads
                  Any idea about this issue ?

                  Thank you

                  AC
                  Have you try to put the option -m ahead, like this?

                  Code:
                  bowtie  -m 1 -q -t --solexa1.3-quals -p 2 --sam xx_index.txt -1 xx.txt -2 xx.txt > xx.sam
                  Xi Wang

                  Comment

                  • yeb3czg
                    Junior Member
                    • Feb 2010
                    • 2

                    Xi
                    I tried it didn't change anything but I found what was the problem in one of my reads file the format of one lane was not correct
                    now it 's working
                    thanks

                    Comment

                    • didymos
                      Junior Member
                      • Jun 2010
                      • 8

                      bowtie and solid

                      Hi,
                      I have another question about solid data...:
                      I have two different reads:
                      T32221113212031121021022123023302010
                      and
                      T22221113212031121021022123023302010
                      In color space difference is only in one number - so in the base space those sequences are completely different, however with bowtie they are mapped to the same seq:

                      bowtie -a -n 0 -C ../indeks/miRNA-mature_cs -3 8 -c T32221113212031121021022123023302010
                      0 + mmT-miR-1944 1 TCTGTGCTGAATGTCAAGTTCTGAT qqqqqqqqqqqqqqqqqqqqqqqqq 0


                      bowtie -a -n 0 -C ../indeks/miRNA-mature_cs -3 8 -c T22221113212031121021022123023302010
                      0 + mmT-miR-1944 1 TCTGTGCTGAATGTCAAGTTCTGAT qqqqqqqqqqqqqqqqqqqqqqqqq 0

                      Why?
                      Thanks for any suggestions!
                      Best!

                      tomek

                      Comment

                      • nilshomer
                        Nils Homer
                        • Nov 2008
                        • 1283

                        Originally posted by didymos View Post
                        Hi,
                        I have another question about solid data...:
                        I have two different reads:
                        T32221113212031121021022123023302010
                        and
                        T22221113212031121021022123023302010
                        In color space difference is only in one number - so in the base space those sequences are completely different, however with bowtie they are mapped to the same seq:

                        bowtie -a -n 0 -C ../indeks/miRNA-mature_cs -3 8 -c T32221113212031121021022123023302010
                        0 + mmT-miR-1944 1 TCTGTGCTGAATGTCAAGTTCTGAT qqqqqqqqqqqqqqqqqqqqqqqqq 0


                        bowtie -a -n 0 -C ../indeks/miRNA-mature_cs -3 8 -c T22221113212031121021022123023302010
                        0 + mmT-miR-1944 1 TCTGTGCTGAATGTCAAGTTCTGAT qqqqqqqqqqqqqqqqqqqqqqqqq 0

                        Why?
                        Thanks for any suggestions!
                        Best!

                        tomek
                        For one of the two sequences, the first color was probably identified as a color error (sequencing error) and corrected appropriately. Color errors (sequencing error) and base differences (SNPs) manifest differently. See the attached PDF for a brief explanation between the differences.
                        Attached Files

                        Comment

                        • didymos
                          Junior Member
                          • Jun 2010
                          • 8

                          Originally posted by nilshomer View Post
                          For one of the two sequences, the first color was probably identified as a color error (sequencing error) and corrected appropriately. Color errors (sequencing error) and base differences (SNPs) manifest differently. See the attached PDF for a brief explanation between the differences.
                          Thank you!
                          very clear presentation. However in my case things are little bit different.
                          I am mapping not to the whole genome but only to the miRNA sequences (previously indexed with bowtie-build). If I understand correctly color is identified as color error when after correction sequence can be easily mapped, but in my case this "error" sequence can be just not a miRNA sequence - other RNA type from RNA-seq experiment.
                          Question is how many color errors are allowed to be correct in bowtie - how I can set it or how can I switch it off if possible?
                          Thanks!

                          tomek

                          Comment

                          • laghs
                            Junior Member
                            • Jul 2010
                            • 2

                            Missed alignment?

                            I noticed that sometimes when I use Bowtie, if the only mismatch is at the 5' end, then no alignment will be produced. Could this be a bug from index building? Here is an example below. With only a single mismatch at the very 5' end, I expect to see the read aligned with -v 2. However, that is not the case. Could someone tell me why and how this can be corrected? Many thanks in advance.

                            ../bowtie-0.12.5/bowtie -t refseq_hs -c CGACTCTTAGCGGTGGATCACTCGG -v 2
                            # reads processed: 1
                            # reads with at least one reported alignment: 0 (0.00%)
                            # reads that failed to align: 1 (100.00%)
                            No alignments


                            ../bowtie-0.12.5/bowtie -t refseq_hs -c GACTCTTAGCGGTGGATCACTCGG -v 2
                            0 + gi|142372596|ref|NR_003285.2| 0 GACTCTTAGCGGTGGATCACTCGG IIIIIIIIIIIIIIIIIIIIIIII 0
                            # reads processed: 1
                            # reads with at least one reported alignment: 1 (100.00%)
                            # reads that failed to align: 0 (0.00%)
                            Reported 1 alignments to 1 output stream(s)

                            Comment

                            • Xi Wang
                              Senior Member
                              • Oct 2009
                              • 317

                              Originally posted by laghs View Post
                              I noticed that sometimes when I use Bowtie, if the only mismatch is at the 5' end, then no alignment will be produced. Could this be a bug from index building? Here is an example below. With only a single mismatch at the very 5' end, I expect to see the read aligned with -v 2. However, that is not the case. Could someone tell me why and how this can be corrected? Many thanks in advance.

                              ../bowtie-0.12.5/bowtie -t refseq_hs -c CGACTCTTAGCGGTGGATCACTCGG -v 2
                              # reads processed: 1
                              # reads with at least one reported alignment: 0 (0.00%)
                              # reads that failed to align: 1 (100.00%)
                              No alignments


                              ../bowtie-0.12.5/bowtie -t refseq_hs -c GACTCTTAGCGGTGGATCACTCGG -v 2
                              0 + gi|142372596|ref|NR_003285.2| 0 GACTCTTAGCGGTGGATCACTCGG IIIIIIIIIIIIIIIIIIIIIIII 0
                              # reads processed: 1
                              # reads with at least one reported alignment: 1 (100.00%)
                              # reads that failed to align: 0 (0.00%)
                              Reported 1 alignments to 1 output stream(s)
                              I think your case was due to Bowtie cann't deal with indel, but not because of 5' mismatches. Note that the aligned position for read GACTCTTAGCGGTGGATCACTCGG (your 2nd read) is 0 (0-based), so for your 1st read: CGACTCTTAGCGGTGGATCACTCGG, it was not able to map to the reference sequences.
                              Xi Wang

                              Comment

                              • laghs
                                Junior Member
                                • Jul 2010
                                • 2

                                Originally posted by Xi Wang View Post
                                I think your case was due to Bowtie cann't deal with indel, but not because of 5' mismatches. Note that the aligned position for read GACTCTTAGCGGTGGATCACTCGG (your 2nd read) is 0 (0-based), so for your 1st read: CGACTCTTAGCGGTGGATCACTCGG, it was not able to map to the reference sequences.
                                I see. Is there a way to get around this problem (other than using another alignment program)? Thanks.

                                Comment

                                Latest Articles

                                Collapse

                                • SEQadmin2
                                  From Collection to Sequencing: Why Sample Preparation and Preservation Define Sequencing Data
                                  by SEQadmin2


                                  Data variability is still an issue in sequencing technologies despite the advances in reproducibility and accuracy of these platforms. But the problem does not originate in the sequencing itself, but in the previous steps, before the sample reaches the sequencer.


                                  The first step is collection, followed by preservation and sample preparation for analysis. Most scientists overlook those steps, but not being careful might just be skewing the experiment’s results.
                                  ...
                                  06-02-2026, 10:05 AM
                                • SEQadmin2
                                  Single-Cell Sequencing at an Inflection Point: Early Impacts of New Platforms and Emerging Trends
                                  by SEQadmin2


                                  With the launch of new single-cell sequencing platforms in 2026, the field stands at an exciting inflection point. This article surveys the most impactful advances in the field and discusses how they’re reshaping research in cancer, immunology, and beyond.


                                  Introduction

                                  Single-cell sequencing technologies have undergone remarkable advances over the past decade, transitioning from low-throughput experimental approaches to highly scalable platforms capable of...
                                  05-22-2026, 06:42 AM
                                • SEQadmin2
                                  Environmental Genomics in the Age of NGS: From Microbes to Conservation Strategies
                                  by SEQadmin2

                                  Studying ecosystems means dealing with complex, multi-species communities that are hard to observe at scale. This complexity, however, hides many important questions to be answered, from how biogeochemical cycles work and how climate change can affect species distribution to how conservation strategies can work best.


                                  Genomics, particularly since the expansion of NGS, has transformed ecosystem ecology. By sequencing environmental DNA, we can now assess biodiversity without direct...
                                  05-06-2026, 09:04 AM

                                ad_right_rmr

                                Collapse

                                News

                                Collapse

                                Topics Statistics Last Post
                                Started by SEQadmin2, 06-02-2026, 12:03 PM
                                0 responses
                                19 views
                                0 reactions
                                Last Post SEQadmin2  
                                Started by SEQadmin2, 06-02-2026, 11:40 AM
                                0 responses
                                14 views
                                0 reactions
                                Last Post SEQadmin2  
                                Started by SEQadmin2, 05-28-2026, 11:40 AM
                                0 responses
                                29 views
                                0 reactions
                                Last Post SEQadmin2  
                                Started by SEQadmin2, 05-26-2026, 10:12 AM
                                0 responses
                                31 views
                                0 reactions
                                Last Post SEQadmin2  
                                Working...