Unconfigured Ad

Collapse
X
 
  • Filter
  • Time
  • Show
Clear All
new posts
  • cristae8
    Junior Member
    • Jun 2011
    • 6

    Samtools mpileup/bcftools

    Hi all,
    I read through samtools manuals several times, but I'm still not clear on how exactly samtools & bcftools decide to call a SNP. I've tried to run through multiple combination of arguments with mpileup (-B, -C, -q, etc) & bcftools, but still ran into the problem below. I even ran bcftools view on the bcf file without the varFilter step, but the problem persists.

    I have 2 samples, an original & an "evolved" cell line. Based on numerous runs, I found that there are many SNPs being called only in the "evolved" cell line but not on the original, making it look like they're "novel" SNP. However, when I view them on IGV, I can see the SNP in the original cell line and there don't seem to be significant differences between the mapping quality or base quality at the SNP position in these 2 samples.
    It's not important to me if reads below a certain mapping quality don't get count, but the trouble is that it seems to be inconsistent. In one sample, SNPs on reads having mapping quality of 0 don't get count, but then they would get counted in the other sample, making it difficult to identify the true novel SNP. Is there anyway to force the SNP count to be more consistent?
    -Ann
  • tlynch1
    Junior Member
    • Apr 2012
    • 2

    #2
    Hi,

    I'm also having a similar problem! I have PCR confirmed SNPs that I can see in IGV when I look at the BAM file, however they are not present after varFilter. I'm guessing that there are parameters that I need to change to include them, but there doesn't seem to be any trend in map quality or coverage that I can distinguish ones that are called vs. missed in the vcf output.

    I mapped my PE Illumina reads with BWA and then running samtools mpileup:
    $ samtools mpileup -uf x.fasta Initial-sorted.bam | bcftools/bcftools view -bvcg - > initial.raw.bcf

    Then ran:
    $bcftools view initial.raw.bcf | vcfutils.pl varFilter -D500 > initial_snps.vcf

    I've tried adding -E switch to the samtools mpileup and get alot more SNPs, however this still does not include the SNPs confirmed by PCR. Also tried increasing the -D value when on the bcftools commands - again increases SNP count, but still not ones I know of.

    Please help! Not sure what else to do....

    Comment

    • twaddlac
      Member
      • Feb 2011
      • 49

      #3
      Try VarScan http://varscan.sourceforge.net/

      It takes the 'mpileup -f reference' output (piped or in a file) and then calls variants based on statistical analysis. Their paper does a good job at describing the algorithm. Plus the command line parameters are much more straightforward.

      I hope this helps!

      Comment

      • tlynch1
        Junior Member
        • Apr 2012
        • 2

        #4
        Thanks so much! The VarScan results seemed to agree much better with my pcr data!

        Comment

        Latest Articles

        Collapse

        • SEQadmin2
          Nine Things a Sample Prep Scientist Thinks About Before Sequencing
          by SEQadmin2


          I’m not a sequencing expert. I’m a purification scientist who uses NGS to evaluate workflows my group develops. With this perspective, we think about the sample first and the NGS workflow second. The sequencer is an exceptionally honest reporter, but it can only report on what you give it, so whether you get clean, interpretable data from an NGS workflow is largely determined before you begin.

          Here are nine questions we think about, in roughly the order they matter, before...
          06-18-2026, 07:11 AM
        • SEQadmin2
          From Collection to Sequencing: Why Sample Preparation and Preservation Define Sequencing Data
          by SEQadmin2


          Data variability is still an issue in sequencing technologies despite the advances in reproducibility and accuracy of these platforms. But the problem does not originate in the sequencing itself, but in the previous steps, before the sample reaches the sequencer.


          The first step is collection, followed by preservation and sample preparation for analysis. Most scientists overlook those steps, but not being careful might just be skewing the experiment’s results.
          ...
          06-02-2026, 10:05 AM

        ad_right_rmr

        Collapse

        News

        Collapse

        Topics Statistics Last Post
        Started by SEQadmin2, Today, 05:37 AM
        0 responses
        5 views
        0 reactions
        Last Post SEQadmin2  
        Started by SEQadmin2, 06-26-2026, 11:10 AM
        0 responses
        16 views
        0 reactions
        Last Post SEQadmin2  
        Started by SEQadmin2, 06-17-2026, 06:09 AM
        0 responses
        50 views
        0 reactions
        Last Post SEQadmin2  
        Started by SEQadmin2, 06-09-2026, 11:58 AM
        0 responses
        110 views
        0 reactions
        Last Post SEQadmin2  
        Working...