Unconfigured Ad

Collapse
X
 
  • Filter
  • Time
  • Show
Clear All
new posts
  • sasignor
    Member
    • Sep 2009
    • 28

    Software for variant frequency

    Hello all,

    I used to be able to use CLC for my SNP calling, which generates the reference base and the frequency of the variant allele in separate columns. I no longer have access to this program and have been using bwa-samtools. I have all of my files in bam format and have been generating SAM pileup tables using the -vcf option. The problem is it gives you the variant allele and consensus allele (not reference allele) and the phred scaled quality of the SNP, not its frequency. Is there a better software to use in order to get this information? Or is there a way to get this information from samtools?
    sasignor is online now Report Post Edit/Delete Message
  • nilshomer
    Nils Homer
    • Nov 2008
    • 1283

    #2
    Samtools pileup gives you the reference allele, consensus call, and all bases (with strandedness) among other things.

    I suggest you take a look at these pages:

    Comment

    • sasignor
      Member
      • Sep 2009
      • 28

      #3
      Ah, yes, the part about the reference allele was a typo. What I mean is I was looking for them all to be generated in separate columns - the variant, and the reference, not as part of a the read bases, as well as the frequency of the alleles as opposed to the SNP quality scores. I think something like VarScan is what I was looking for, although I am still interested as to what other people use to generate this type of output.

      Comment

      • nilshomer
        Nils Homer
        • Nov 2008
        • 1283

        #4
        Originally posted by sasignor View Post
        Ah, yes, the part about the reference allele was a typo. What I mean is I was looking for them all to be generated in separate columns - the variant, and the reference, not as part of a the read bases, as well as the frequency of the alleles as opposed to the SNP quality scores. I think something like VarScan is what I was looking for, although I am still interested as to what other people use to generate this type of output.

        http://varscan.sourceforge.net/using-varscan.html
        A few lines of code using the read bases column conditioned on the variant call and reference allele would do it. You have all the information.

        Comment

        • kmcarr
          Senior Member
          • May 2008
          • 1181

          #5
          Here is a script I wrote for doing this. Pass the script the name of your pileup file (created with -vcf and any other filtering) and it will output the following (as tab delimited columns):

          1. Chromosome
          2. Position
          3. SNP (Ref > Alt base(s))
          4. Consensus quality
          5. Read depth
          6. Alt base fraction (Alt base/Read depth)
          7. Alt base %
          Attached Files

          Comment

          Latest Articles

          Collapse

          • SEQadmin2
            Nine Things a Sample Prep Scientist Thinks About Before Sequencing
            by SEQadmin2


            I’m not a sequencing expert. I’m a purification scientist who uses NGS to evaluate workflows my group develops. With this perspective, we think about the sample first and the NGS workflow second. The sequencer is an exceptionally honest reporter, but it can only report on what you give it, so whether you get clean, interpretable data from an NGS workflow is largely determined before you begin.

            Here are nine questions we think about, in roughly the order they matter, before...
            06-18-2026, 07:11 AM
          • SEQadmin2
            From Collection to Sequencing: Why Sample Preparation and Preservation Define Sequencing Data
            by SEQadmin2


            Data variability is still an issue in sequencing technologies despite the advances in reproducibility and accuracy of these platforms. But the problem does not originate in the sequencing itself, but in the previous steps, before the sample reaches the sequencer.


            The first step is collection, followed by preservation and sample preparation for analysis. Most scientists overlook those steps, but not being careful might just be skewing the experiment’s results.
            ...
            06-02-2026, 10:05 AM

          ad_right_rmr

          Collapse

          News

          Collapse

          Topics Statistics Last Post
          Started by SEQadmin2, Today, 11:10 AM
          0 responses
          6 views
          0 reactions
          Last Post SEQadmin2  
          Started by SEQadmin2, 06-17-2026, 06:09 AM
          0 responses
          41 views
          0 reactions
          Last Post SEQadmin2  
          Started by SEQadmin2, 06-09-2026, 11:58 AM
          0 responses
          102 views
          0 reactions
          Last Post SEQadmin2  
          Started by SEQadmin2, 06-05-2026, 10:09 AM
          0 responses
          123 views
          0 reactions
          Last Post SEQadmin2  
          Working...