Seqanswers Leaderboard Ad

Collapse

Announcement

Collapse
No announcement yet.
X
 
  • Filter
  • Time
  • Show
Clear All
new posts

  • Quality score after Illumina run - should it be coverted before samtools and gatk?

    Hi all,

    I am a bit confused with the base quality score. After Illumina run, I use BWA to map the reads. I then use downstream applications such as samtools and gatk. But samtools accepts sanger phred (ASCII-33), while Illimina's fastq is ASCII-64. Is it correct that the fastq need to be converted before running BWA to sanger scores? If so, what tools do people use for this?

  • #2
    yes, you should convert first, even before BWA.
    google for fq_all2std.pl

    Comment


    • #3
      thanks a lot!

      Comment


      • #4
        I took a look at the code of fq_all2std.pl . I believe I need the command:
        sol2std Convert Solexa/Illumina FASTQ to the standard FASTQ

        But doesn't it convert the old Illumina (prior to version 1.3) to sanger?

        The conversion table it uses:
        for (-64..64) {
        $conv_table[$_+64] = chr(int(33 + 10*log(1+10**($_/10.0))/log(10)+.499));
        }

        Comment


        • #5
          But doesn't it convert the old Illumina (prior to version 1.3) to sanger?
          Yeah this script converts only solexa format and not illumina(1.3 or 1.5+) one.

          Comment


          • #6
            BWA option -I

            In the recent versions in BWA you no longer need to convert the quality scores. From the BWA manual:

            -I The input is in the Illumina 1.3+ read format (quality equals ASCII-64).

            Comment


            • #7
              So it seems you don't need to convert before BWA. Do you need to convert before GATK or before samtools?

              Comment


              • #8
                There isn't any need to, as all SAM/BAM files have QUAL field: "ASCII of Phred-scaled base QUALity+33". This way you don't need to worry about the quality scale in SAM/BAM.

                (Note that it is possible to get qualities in a SAM/BAM like file that are scaled QUALity+64, BUT these are not real SAM/BAM files. BWA produces a proper SAM/BAM file with the correct quality scale.)

                Comment


                • #9
                  what is the consequence of losing -I option?

                  As asked in the title, what if I missed the -I option while run bwa? is there any remedial work I can do to save it? thx!

                  Comment


                  • #10
                    Current Illumina's qualities in the fastq files changed since the first post on this issue. They are now (Illumina 1.8+) Phred+33, and therefore no need to specify -I for BWA. Just make sure the fastq files you are working with are indeed Illumina 1.8+.

                    Comment

                    Latest Articles

                    Collapse

                    • seqadmin
                      Non-Coding RNA Research and Technologies
                      by seqadmin




                      Non-coding RNAs (ncRNAs) do not code for proteins but play important roles in numerous cellular processes including gene silencing, developmental pathways, and more. There are numerous types including microRNA (miRNA), long ncRNA (lncRNA), circular RNA (circRNA), and more. In this article, we discuss innovative ncRNA research and explore recent technological advancements that improve the study of ncRNAs.

                      Nobel Prize for MicroRNA Discovery
                      This week,...
                      10-07-2024, 08:07 AM
                    • seqadmin
                      Recent Developments in Metagenomics
                      by seqadmin





                      Metagenomics has improved the way researchers study microorganisms across diverse environments. Historically, studying microorganisms relied on culturing them in the lab, a method that limits the investigation of many species since most are unculturable1. Metagenomics overcomes these issues by allowing the study of microorganisms regardless of their ability to be cultured or the environments they inhabit. Over time, the field has evolved, especially with the advent...
                      09-23-2024, 06:35 AM

                    ad_right_rmr

                    Collapse

                    News

                    Collapse

                    Topics Statistics Last Post
                    Started by seqadmin, Today, 06:35 AM
                    0 responses
                    7 views
                    0 likes
                    Last Post seqadmin  
                    Started by seqadmin, Yesterday, 02:44 PM
                    0 responses
                    7 views
                    0 likes
                    Last Post seqadmin  
                    Started by seqadmin, 10-11-2024, 06:55 AM
                    0 responses
                    15 views
                    0 likes
                    Last Post seqadmin  
                    Started by seqadmin, 10-02-2024, 04:51 AM
                    0 responses
                    111 views
                    0 likes
                    Last Post seqadmin  
                    Working...
                    X