Seqanswers Leaderboard Ad

Collapse

Announcement

Collapse
No announcement yet.
X
 
  • Filter
  • Time
  • Show
Clear All
new posts

  • parameter set of bwa-samtools for snp calling

    hi, everyone! I use bwa-samtools for my rice genome snp calling recently, However I found in the "mpileup" step, the snp calling result is significantly different according to the related parameter set.eg. no -6A, the variation is 66041, however, used -6A ,the variation is 181319(triple than before), is that due to the anomalous reads?
    The result let me confused on how to set the parameter to get the accurate result?

  • #2
    I have tested, that's due to the -6, it means Illumina 1.3+ format. why?
    I have check my fastq file, it is Illumina 1.5+ format, should I add -6?
    All the reply is appreciated!

    Comment


    • #3
      The answer is yes, you should use the -6 parameter if your reads are in Illumina 1.3+ format. Also when aligning with bwa did you use the -I option? If not your mapping qualities may not be accurate and that may affect the SNP calling

      Comment


      • #4
        Originally posted by vv85 View Post
        The answer is yes, you should use the -6 parameter if your reads are in Illumina 1.3+ format. Also when aligning with bwa did you use the -I option? If not your mapping qualities may not be accurate and that may affect the SNP calling
        Could you say more in detail?

        Comment


        • #5
          For more info on the two different fastq formats you can use the wiki page:


          So for example, let's say you have this:
          @HWUSI-232323
          ATTTTA
          +
          BBBBBB

          If the format is Phred+33 then you have a good read. If the format is Phred+64 then you have a bad read. Mapping qualities depend on base qualities so by not specifying the actual format mapping qualities will not reflect correctly what's in the data. Since SNP callers use this MAPQ for variant calls, this leads to differences.

          Comment


          • #6
            parameter set of bwa-samtools for snp calling

            Hi,
            Everyone, I use bwa-samtools for human genome SNPs calling, but the SRA READS are performed using the Illumina Genome Analyzer Pipeline v.1.4.0 software suite. Should I need to run BWA alignment without -I or -L or -J options for diferent illumina versions. When I ran with -I option there was error saying sequence and quality are inconsistent. I would be glad and appreciate for your consideration.

            Rocky

            Comment


            • #7
              I wouldn't think that specific problem is related to the parameter. Do you get a different result without using the -I option? At which step does the error occur?

              Comment


              • #8
                Originally posted by vv85 View Post
                I wouldn't think that specific problem is related to the parameter. Do you get a different result without using the -I option? At which step does the error occur?
                hi,vv85
                I had checked the result of using -I and without using -I when run bwa, there is no difference in sam format report, except that when using -I , it converted the quality score of Illumina format into sanger format in the reads quality column. However, without using -I, it didn't do this job.

                Comment


                • #9
                  That's more or less what's expected, depending on the overall quality of your reads some mapping qualities could be changed too. With these reads now should be able to use samtools without the -6 and get (somewhat) confident SNPs

                  Comment


                  • #10
                    When I try running with -I option at BWA alignment and converted to sam file then to bam file error occurs.

                    These are the comment
                    SEQUENCE ALIGNMENT
                    $ ./bwa aln -I -t 4 /reference_genome/hg18bwaidx ~/SRR000001_1.fastq > ~/SRR000001_1.sai
                    $ ./bwa aln -I -t 4 /reference_genome/hg18bwaidx ~/SRR000001_2.fastq > ~/SRR000001_2.sai


                    PARIED-END MAPPING
                    $ ./bwa sampe /reference_genome/hg18bwaidx ~/bwa-0.5.9/SRR000001_1.sai ~/bwa-0.5.9/SRR000001_2.sai ~/SRR000001_1.fastq ~/SRR000001_2.fastq > ~/SRR000001.sam


                    CONVERT TO BAM FILES
                    $ ./samtools view -b -h -S /SRR000001.sam > ~/SRR000001.bam

                    ERROR message - sequence and quality are inconsistent

                    Comment


                    • #11
                      paired-end reads were obtained from GAII (Illumina Genome Analyzer Pipeline v.1.4.0)

                      Comment

                      Latest Articles

                      Collapse

                      • seqadmin
                        Exploring the Dynamics of the Tumor Microenvironment
                        by seqadmin




                        The complexity of cancer is clearly demonstrated in the diverse ecosystem of the tumor microenvironment (TME). The TME is made up of numerous cell types and its development begins with the changes that happen during oncogenesis. “Genomic mutations, copy number changes, epigenetic alterations, and alternative gene expression occur to varying degrees within the affected tumor cells,” explained Andrea O’Hara, Ph.D., Strategic Technical Specialist at Azenta. “As...
                        07-08-2024, 03:19 PM
                      • seqadmin
                        Exploring Human Diversity Through Large-Scale Omics
                        by seqadmin


                        In 2003, researchers from the Human Genome Project (HGP) announced the most comprehensive genome to date1. Although the genome wasn’t fully completed until nearly 20 years later2, numerous large-scale projects, such as the International HapMap Project and 1000 Genomes Project, continued the HGP's work, capturing extensive variation and genomic diversity within humans. Recently, newer initiatives have significantly increased in scale and expanded beyond genomics, offering a more detailed...
                        06-25-2024, 06:43 AM

                      ad_right_rmr

                      Collapse

                      News

                      Collapse

                      Topics Statistics Last Post
                      Started by seqadmin, 07-19-2024, 07:20 AM
                      0 responses
                      25 views
                      0 likes
                      Last Post seqadmin  
                      Started by seqadmin, 07-16-2024, 05:49 AM
                      0 responses
                      41 views
                      0 likes
                      Last Post seqadmin  
                      Started by seqadmin, 07-15-2024, 06:53 AM
                      0 responses
                      46 views
                      0 likes
                      Last Post seqadmin  
                      Started by seqadmin, 07-10-2024, 07:30 AM
                      0 responses
                      42 views
                      0 likes
                      Last Post seqadmin  
                      Working...
                      X