Unconfigured Ad

Collapse
X
 
  • Filter
  • Time
  • Show
Clear All
new posts
  • abelhj
    Junior Member
    • Dec 2009
    • 4

    Samtools/bcftools problem

    I'm using samtools to look for SNPs with ultradeep (>18,000X) coverage. For some reason samtools and bcftools does not call some of the SNPs.

    Code:
    gi|251831106|ref|NC_012920.1|	16125	.	G	.	99	.	DP=18552;AF1=0;CI95=1.5,0;DP4=832,646,0,1;MQ=35;PV4=0.44,0.14,1,1	PL	0
    gi|251831106|ref|NC_012920.1|	16126	.	T	C	99	.	DP=18770;AF1=1;CI95=1,1;DP4=28,21,777,596;MQ=35;PV4=1,0,0.00042,0.28	PL	219,255,0
    For example, at position 16215, IGV gives me:
    Total count: 18522
    A: 17030 (92%, 8124+, 8906-)
    C: 0
    G: 1516 (8%, 861+, 655-)
    T: 5 (0%, 4+, 1-)
    N:1 (0%, 1+, 0-)

    Sam/bcftools should be making a SNP call at position 16215 but is not. Also, the first two counts in DP4 almost agree between bcftools and IGV, but the third and fourth do not. The sequences are quality trimmed and the reads containing the A allele are not of low quality.

    Anyone know why? Thanks.
  • cboustred
    Member
    • May 2010
    • 11

    #2
    Did you get any response to your post?

    I have similar data with ultra deep coverage where the total depth counts seen in IGV match the bcftools DP but the alternate allele counts are close but do not match those in the DP4 bcftools output.

    I assume this is to do with the quality of the calls but I thought I had disabled this with -BQ0 when generating the mpileup

    Any ideas?

    Thanks

    Comment

    • dariober
      Senior Member
      • May 2010
      • 311

      #3
      Originally posted by abelhj View Post
      I'm using samtools to look for SNPs with ultradeep (>18,000X) coverage. For some reason samtools and bcftools does not call some of the SNPs.
      Hi,

      Can you post the commands of samtools/bcftools you are using? If you use vcfutils.pl varFilter there is a -D option that does not call SNPs exceeding D depth (the default is 10000000 so probably this is not the problem if you didn't change it!)

      Sorry I can't be of more help...

      All the best
      Dario

      Comment

      • cboustred
        Member
        • May 2010
        • 11

        #4
        Hi, thanks for the quick response

        I am simply running

        samtools mpileup -l site.txt -BQ0 -d1000000 -uf ref.fasta test.sorted.bam > test_mpileup.bcf

        bcftools view -Acg test_mpileup.bcf

        I am just after the number of reads for each allele at a specific position, like the display you can get in IGV that the OP showed

        Comment

        • swbarnes2
          Senior Member
          • May 2008
          • 910

          #5
          Maybe try:

          mpileup -l site.txt -BQ0 -d1000000 ref.fasta test.sorted.bam > test_mpileup.pileup

          To see what the quality of your alternate letters are according to mpileup. Then look at the individual sam entries for those reads. Maybe there's a discrepancy there.

          Comment

          • cboustred
            Member
            • May 2010
            • 11

            #6
            Thanks for the reply, I've just come across VarScan (http://varscan.sourceforge.net/index.html) which works on the pileup output from samtools.

            Using this I am able to get the counts per allele per position in the pileup which also matches the IGV calls

            All the best

            Chris

            Comment

            Latest Articles

            Collapse

            • GATTACAT
              Reply to Nine Things a Sample Prep Scientist Thinks About Before Sequencing
              by GATTACAT
              Love this - good data definitely starts from good input, and poor input can only give relatively poor data. I particularly like the mention of Nanodrop/absorbance based methods for quantification. It's such a toss up if you'll get an accurate reading or what amounts to a randomly generated number, and a lot of library/sequencing related issues can be traced back to poor quant.
              07-01-2026, 11:43 AM
            • SEQadmin2
              Nine Things a Sample Prep Scientist Thinks About Before Sequencing
              by SEQadmin2


              I’m not a sequencing expert. I’m a purification scientist who uses NGS to evaluate workflows my group develops. With this perspective, we think about the sample first and the NGS workflow second. The sequencer is an exceptionally honest reporter, but it can only report on what you give it, so whether you get clean, interpretable data from an NGS workflow is largely determined before you begin.

              Here are nine questions we think about, in roughly the order they matter, before...
              06-18-2026, 07:11 AM

            ad_right_rmr

            Collapse

            News

            Collapse

            Topics Statistics Last Post
            Started by SEQadmin2, Yesterday, 11:08 AM
            0 responses
            7 views
            0 reactions
            Last Post SEQadmin2  
            Started by SEQadmin2, 06-30-2026, 05:37 AM
            0 responses
            11 views
            0 reactions
            Last Post SEQadmin2  
            Started by SEQadmin2, 06-26-2026, 11:10 AM
            0 responses
            19 views
            0 reactions
            Last Post SEQadmin2  
            Started by SEQadmin2, 06-17-2026, 06:09 AM
            0 responses
            53 views
            0 reactions
            Last Post SEQadmin2  
            Working...