Seqanswers Leaderboard Ad

Collapse

Announcement

Collapse
No announcement yet.
X
 
  • Filter
  • Time
  • Show
Clear All
new posts

  • Bowtie mapping quality in SAM

    Hi, how do you interpret the quality of the alignment in SAM files generated by bowtie? I only see values being 0 or 255 (for unmapped and mapped sequences), is this just me?

    d

  • #2
    Originally posted by dawe View Post
    Hi, how do you interpret the quality of the alignment in SAM files generated by bowtie? I only see values being 0 or 255 (for unmapped and mapped sequences), is this just me?

    d
    Bowtie doesn't calculate Maq-like mapping quality values, so (per the SAM spec) it prints 255 there if the read aligns or 0 otherwise.

    Ben

    Comment


    • #3
      Ben,
      So, does Bowtie give its own mapping quality values somewhere or it does not report any mapping quality values at all? As the manual said, --best option can report best-worst order alignments, how can I measure the level between a better alignment to a worse alignment without mapping quality? Thanks.

      Comment


      • #4
        Originally posted by xinwu View Post
        Ben,
        So, does Bowtie give its own mapping quality values somewhere or it does not report any mapping quality values at all? As the manual said, --best option can report best-worst order alignments, how can I measure the level between a better alignment to a worse alignment without mapping quality? Thanks.
        I'm also interested in this question...

        Dario

        Comment


        • #5
          Originally posted by dawe View Post
          Hi, how do you interpret the quality of the alignment in SAM files generated by bowtie? I only see values being 0 or 255 (for unmapped and mapped sequences), is this just me?

          d
          Hi,

          I have the same problem when aligning reads using Tophat.

          The values in my sam file generated by tophat range from 0 to 255, which is different from your example.

          I look up the field interpretation for the quality value by reading The SAM Format Speci cation, which tells me that in the attached snapshot.

          The value ranges from 0 to +∞ with that the zero represents the highest probability of mapping wrong and +∞ is for the lowest probability of mapping wrong, according to the formula given in the document. The range 0~+∞ could be normalized into 0~255, right?

          But why does the document tell that the value 255 indicates that the mapping quality is not available? I have an inverse answer to it.

          I am wondering if I have made a mistake of understanding.

          Hope someone could help me.

          Thanks in advance,
          Attached Files

          Comment


          • #6
            Originally posted by Hunny View Post
            Hi,

            I have the same problem when aligning reads using Tophat.

            The values in my sam file generated by tophat range from 0 to 255, which is different from your example.

            I look up the field interpretation for the quality value by reading The SAM Format Speci cation, which tells me that in the attached snapshot.

            The value ranges from 0 to +∞ with that the zero represents the highest probability of mapping wrong and +∞ is for the lowest probability of mapping wrong, according to the formula given in the document. The range 0~+∞ could be normalized into 0~255, right?

            But why does the document tell that the value 255 indicates that the mapping quality is not available? I have an inverse answer to it.

            I am wondering if I have made a mistake of understanding.

            Hope someone could help me.

            Thanks in advance,
            Well, many facts manifest that the higher the value, the better the alignment with 255 of its maximum. So, I guess "mapping quality is not available" means the value 255 is meaningless for such a perfect alignment, right?

            Comment


            • #7
              Would like to revisit this. Is it clear now that 255 means a top score alignment?

              Comment


              • #8
                Found this elsewhere on the web>

                Tophat/bowtie don’t report mapping quality values that are as meaningful as BWA, but there is some information in the mapping quality values tophat reports. Tophat yields 4 distinct values for its mapping quality values (you can do a “unique” count on the mapping quality field of any SAM file from tophat to verify this):



                255 = unique mapping

                3 = maps to 2 locations in the target

                2 = maps to 3 locations

                1 = maps to 4-9 locations

                0 = maps to 10 or more locations.



                Except for the 255 case, the simple rule that was encoded by the authors is the usual phred quality scale:



                MapQ = -10 log10(P)



                Where P = probability that this mapping is NOT the correct one. The authors ignore the number of mismatches in this calculation and simply assume that if it maps to 2 locations then P = 0.5, 3 locations implies P = 2/3, 4 locations => P = 3/4 etc.



                As you can clearly see, then MapQ = -10 log10(0.5) = 3; -10 log10(2/3) = 1.76 (rounds to 2);

                -10 log10(3/4) = 1.25 (rounds to 1), etc.

                Comment


                • #9
                  Thanks carmeyeii. That is very helpful!

                  Comment


                  • #10
                    BWA or bowtie2 report actual mapping qualities, which can be very useful, especially if you are analyzing ChIP-Seq or do SNP calling. Bowtie does not.

                    Comment

                    Latest Articles

                    Collapse

                    • seqadmin
                      Best Practices for Single-Cell Sequencing Analysis
                      by seqadmin



                      While isolating and preparing single cells for sequencing was historically the bottleneck, recent technological advancements have shifted the challenge to data analysis. This highlights the rapidly evolving nature of single-cell sequencing. The inherent complexity of single-cell analysis has intensified with the surge in data volume and the incorporation of diverse and more complex datasets. This article explores the challenges in analysis, examines common pitfalls, offers...
                      Yesterday, 07:15 AM
                    • seqadmin
                      Latest Developments in Precision Medicine
                      by seqadmin



                      Technological advances have led to drastic improvements in the field of precision medicine, enabling more personalized approaches to treatment. This article explores four leading groups that are overcoming many of the challenges of genomic profiling and precision medicine through their innovative platforms and technologies.

                      Somatic Genomics
                      “We have such a tremendous amount of genetic diversity that exists within each of us, and not just between us as individuals,”...
                      05-24-2024, 01:16 PM

                    ad_right_rmr

                    Collapse

                    News

                    Collapse

                    Topics Statistics Last Post
                    Started by seqadmin, Yesterday, 08:18 AM
                    0 responses
                    13 views
                    0 likes
                    Last Post seqadmin  
                    Started by seqadmin, Yesterday, 08:04 AM
                    0 responses
                    12 views
                    0 likes
                    Last Post seqadmin  
                    Started by seqadmin, 06-03-2024, 06:55 AM
                    0 responses
                    13 views
                    0 likes
                    Last Post seqadmin  
                    Started by seqadmin, 05-30-2024, 03:16 PM
                    0 responses
                    27 views
                    0 likes
                    Last Post seqadmin  
                    Working...
                    X