Unconfigured Ad

Collapse
X
 
  • Filter
  • Time
  • Show
Clear All
new posts
  • lchong
    Junior Member
    • Feb 2012
    • 2

    VCF 'QUAL' tool

    I'm working on generating some quality statistics for various BAM files. One number I'd like to generate is the confidence of the base call for each base--essentially the QUAL column of the VCF format spec (http://www.1000genomes.org/wiki/Anal...mat-version-41). However, I don't want to generate an entire VCF file, just a simple tab-delimited file that shows chromosome, position, and genotype confidence score.

    I've considered doing the calculation by hand, but I'd like to know if there is some existing tool/function that can accomplish this task for me. Again, I'm not interested in outputting any other data such as the actual base calls--just the confidence scores.

    Thanks for your help!
  • Richard Barker
    Member
    • Apr 2012
    • 47

    #2
    I'm also to trying to generate a VCF (to use to generate counts per gene with some Arabidopsis RNAseq data i have) file for Arabidopsis thaliana but am not sure where to start... any advice

    Comment

    • zgtmann
      Member
      • Feb 2013
      • 13

      #3
      Hi guys,

      just use GATK to generate tab from your vfc file.

      MAKING TAB-DELIMITED FILE FROM VCF BY GATK

      java -jar GenomeAnalysisTK.jar \
      -R reference.fasta
      -T VariantsToTable \
      -V file.vcf \
      -F CHROM -F POS -F ID -F QUAL -F AC \ % what do you want
      -o results.table

      Comment

      • aeonsim
        Member
        • Jun 2011
        • 46

        #4
        Just remember QUAL score will be confounded by Copy Number and SVs present in your individual/population. You'll get some very high QUAL scores for sites in the genome that have higher than expected coverage as lots of reads will appear to support a Variant at that site when actually it should be multiple sites. If you are going to be working with QUAL make sure you apply a Depth of Coverage filter to discard sites with depth of Coverage greater than ~1.2x the average depth of coverage for the sample.

        Comment

        Latest Articles

        Collapse

        • GATTACAT
          Reply to Nine Things a Sample Prep Scientist Thinks About Before Sequencing
          by GATTACAT
          Love this - good data definitely starts from good input, and poor input can only give relatively poor data. I particularly like the mention of Nanodrop/absorbance based methods for quantification. It's such a toss up if you'll get an accurate reading or what amounts to a randomly generated number, and a lot of library/sequencing related issues can be traced back to poor quant.
          07-01-2026, 11:43 AM
        • SEQadmin2
          Nine Things a Sample Prep Scientist Thinks About Before Sequencing
          by SEQadmin2


          I’m not a sequencing expert. I’m a purification scientist who uses NGS to evaluate workflows my group develops. With this perspective, we think about the sample first and the NGS workflow second. The sequencer is an exceptionally honest reporter, but it can only report on what you give it, so whether you get clean, interpretable data from an NGS workflow is largely determined before you begin.

          Here are nine questions we think about, in roughly the order they matter, before...
          06-18-2026, 07:11 AM

        ad_right_rmr

        Collapse

        News

        Collapse

        Topics Statistics Last Post
        Started by SEQadmin2, 07-02-2026, 11:08 AM
        0 responses
        7 views
        0 reactions
        Last Post SEQadmin2  
        Started by SEQadmin2, 06-30-2026, 05:37 AM
        0 responses
        12 views
        0 reactions
        Last Post SEQadmin2  
        Started by SEQadmin2, 06-26-2026, 11:10 AM
        0 responses
        20 views
        0 reactions
        Last Post SEQadmin2  
        Started by SEQadmin2, 06-17-2026, 06:09 AM
        0 responses
        54 views
        0 reactions
        Last Post SEQadmin2  
        Working...