Seqanswers Leaderboard Ad

Collapse

Announcement

Collapse
No announcement yet.
X
 
  • Filter
  • Time
  • Show
Clear All
new posts

  • parameter set of bwa-samtools for snp calling

    hi, everyone! I use bwa-samtools for my rice genome snp calling recently, However I found in the "mpileup" step, the snp calling result is significantly different according to the related parameter set.eg. no -6A, the variation is 66041, however, used -6A ,the variation is 181319(triple than before), is that due to the anomalous reads?
    The result let me confused on how to set the parameter to get the accurate result?

  • #2
    I have tested, that's due to the -6, it means Illumina 1.3+ format. why?
    I have check my fastq file, it is Illumina 1.5+ format, should I add -6?
    All the reply is appreciated!

    Comment


    • #3
      The answer is yes, you should use the -6 parameter if your reads are in Illumina 1.3+ format. Also when aligning with bwa did you use the -I option? If not your mapping qualities may not be accurate and that may affect the SNP calling

      Comment


      • #4
        Originally posted by vv85 View Post
        The answer is yes, you should use the -6 parameter if your reads are in Illumina 1.3+ format. Also when aligning with bwa did you use the -I option? If not your mapping qualities may not be accurate and that may affect the SNP calling
        Could you say more in detail?

        Comment


        • #5
          For more info on the two different fastq formats you can use the wiki page:


          So for example, let's say you have this:
          @HWUSI-232323
          ATTTTA
          +
          BBBBBB

          If the format is Phred+33 then you have a good read. If the format is Phred+64 then you have a bad read. Mapping qualities depend on base qualities so by not specifying the actual format mapping qualities will not reflect correctly what's in the data. Since SNP callers use this MAPQ for variant calls, this leads to differences.

          Comment


          • #6
            parameter set of bwa-samtools for snp calling

            Hi,
            Everyone, I use bwa-samtools for human genome SNPs calling, but the SRA READS are performed using the Illumina Genome Analyzer Pipeline v.1.4.0 software suite. Should I need to run BWA alignment without -I or -L or -J options for diferent illumina versions. When I ran with -I option there was error saying sequence and quality are inconsistent. I would be glad and appreciate for your consideration.

            Rocky

            Comment


            • #7
              I wouldn't think that specific problem is related to the parameter. Do you get a different result without using the -I option? At which step does the error occur?

              Comment


              • #8
                Originally posted by vv85 View Post
                I wouldn't think that specific problem is related to the parameter. Do you get a different result without using the -I option? At which step does the error occur?
                hi,vv85
                I had checked the result of using -I and without using -I when run bwa, there is no difference in sam format report, except that when using -I , it converted the quality score of Illumina format into sanger format in the reads quality column. However, without using -I, it didn't do this job.

                Comment


                • #9
                  That's more or less what's expected, depending on the overall quality of your reads some mapping qualities could be changed too. With these reads now should be able to use samtools without the -6 and get (somewhat) confident SNPs

                  Comment


                  • #10
                    When I try running with -I option at BWA alignment and converted to sam file then to bam file error occurs.

                    These are the comment
                    SEQUENCE ALIGNMENT
                    $ ./bwa aln -I -t 4 /reference_genome/hg18bwaidx ~/SRR000001_1.fastq > ~/SRR000001_1.sai
                    $ ./bwa aln -I -t 4 /reference_genome/hg18bwaidx ~/SRR000001_2.fastq > ~/SRR000001_2.sai


                    PARIED-END MAPPING
                    $ ./bwa sampe /reference_genome/hg18bwaidx ~/bwa-0.5.9/SRR000001_1.sai ~/bwa-0.5.9/SRR000001_2.sai ~/SRR000001_1.fastq ~/SRR000001_2.fastq > ~/SRR000001.sam


                    CONVERT TO BAM FILES
                    $ ./samtools view -b -h -S /SRR000001.sam > ~/SRR000001.bam

                    ERROR message - sequence and quality are inconsistent

                    Comment


                    • #11
                      paired-end reads were obtained from GAII (Illumina Genome Analyzer Pipeline v.1.4.0)

                      Comment

                      Latest Articles

                      Collapse

                      • seqadmin
                        Recent Developments in Metagenomics
                        by seqadmin





                        Metagenomics has improved the way researchers study microorganisms across diverse environments. Historically, studying microorganisms relied on culturing them in the lab, a method that limits the investigation of many species since most are unculturable1. Metagenomics overcomes these issues by allowing the study of microorganisms regardless of their ability to be cultured or the environments they inhabit. Over time, the field has evolved, especially with the advent...
                        09-23-2024, 06:35 AM
                      • seqadmin
                        Understanding Genetic Influence on Infectious Disease
                        by seqadmin




                        During the COVID-19 pandemic, scientists observed that while some individuals experienced severe illness when infected with SARS-CoV-2, others were barely affected. These disparities left researchers and clinicians wondering what causes the wide variations in response to viral infections and what role genetics plays.

                        Jean-Laurent Casanova, M.D., Ph.D., Professor at Rockefeller University, is a leading expert in this crossover between genetics and infectious...
                        09-09-2024, 10:59 AM

                      ad_right_rmr

                      Collapse

                      News

                      Collapse

                      Topics Statistics Last Post
                      Started by seqadmin, 10-02-2024, 04:51 AM
                      0 responses
                      13 views
                      0 likes
                      Last Post seqadmin  
                      Started by seqadmin, 10-01-2024, 07:10 AM
                      0 responses
                      20 views
                      0 likes
                      Last Post seqadmin  
                      Started by seqadmin, 09-30-2024, 08:33 AM
                      0 responses
                      25 views
                      0 likes
                      Last Post seqadmin  
                      Started by seqadmin, 09-26-2024, 12:57 PM
                      0 responses
                      18 views
                      0 likes
                      Last Post seqadmin  
                      Working...
                      X