Unconfigured Ad

Collapse
X
 
  • Filter
  • Time
  • Show
Clear All
new posts
  • chrisbala
    Member
    • Jan 2010
    • 82

    Allelic Depth from VCF

    I'm trying to estimate the # of reads representing each allele across multiple samples in RNA-seq data.

    I've identified SNPs with samtools mpileup and vcftools call, but allelic depth is not provided in the vcf.

    I gather that GATK Variant Annotator (-A DepthPerAlleleBySample) might be able to extract this info from the vcf, but GATK seems unhappy with my vcf format

    Code:
    Your input file has a malformed header: unexpected tag count 5 in line <ID=VDB,Number=1,Type=Float,Description="Variant Distance Bias for filtering splice-site artefacts in RNA-seq data (bigger is better)",Version="3">
    Does anyone have other suggests for getting per sample/per allele counts? Or a suggestion to get around this formatting issue?

    Thanks!
  • SylvainL
    Senior Member
    • Feb 2012
    • 180

    #2
    I seem to remember that with bcftools, you can have this information in the bcf file...

    Comment

    • chrisbala
      Member
      • Jan 2010
      • 82

      #3
      OOps, I meant "bcftools call" not vcftools

      Comment

      • SNPsaurus
        Registered Vendor
        • May 2013
        • 525

        #4
        If you pass the DPR option to bcftools call it gives the allele depth (I also do DP for total read depth):
        samtools mpileup -gu -t DP,DPR

        The output is 0/1:153,0,125:61:20,41,0,0
        0/1 allele call
        153,0,125 prob of allele call
        61 read depth
        20,41,0,0 depth of different alleles
        Providing nextRAD genotyping and PacBio sequencing services. http://snpsaurus.com

        Comment

        • chrisbala
          Member
          • Jan 2010
          • 82

          #5
          Allelic Depth

          Excellent! I'll give it a shot!

          Comment

          Latest Articles

          Collapse

          • SEQadmin2
            Nine Things a Sample Prep Scientist Thinks About Before Sequencing
            by SEQadmin2


            I’m not a sequencing expert. I’m a purification scientist who uses NGS to evaluate workflows my group develops. With this perspective, we think about the sample first and the NGS workflow second. The sequencer is an exceptionally honest reporter, but it can only report on what you give it, so whether you get clean, interpretable data from an NGS workflow is largely determined before you begin.

            Here are nine questions we think about, in roughly the order they matter, before...
            06-18-2026, 07:11 AM
          • SEQadmin2
            From Collection to Sequencing: Why Sample Preparation and Preservation Define Sequencing Data
            by SEQadmin2


            Data variability is still an issue in sequencing technologies despite the advances in reproducibility and accuracy of these platforms. But the problem does not originate in the sequencing itself, but in the previous steps, before the sample reaches the sequencer.


            The first step is collection, followed by preservation and sample preparation for analysis. Most scientists overlook those steps, but not being careful might just be skewing the experiment’s results.
            ...
            06-02-2026, 10:05 AM

          ad_right_rmr

          Collapse

          News

          Collapse

          Topics Statistics Last Post
          Started by SEQadmin2, 06-26-2026, 11:10 AM
          0 responses
          10 views
          0 reactions
          Last Post SEQadmin2  
          Started by SEQadmin2, 06-17-2026, 06:09 AM
          0 responses
          45 views
          0 reactions
          Last Post SEQadmin2  
          Started by SEQadmin2, 06-09-2026, 11:58 AM
          0 responses
          105 views
          0 reactions
          Last Post SEQadmin2  
          Started by SEQadmin2, 06-05-2026, 10:09 AM
          0 responses
          125 views
          0 reactions
          Last Post SEQadmin2  
          Working...