Seqanswers Leaderboard Ad

**dkoboldt** · 05-01-2012, 07:17 AM

Hello,

The latest version of VarScan (v2.2.11, just posted) includes a VCF output option for somatic mutations.

This option was already available for multi-sample germline variant calling (mpileup2snp, mpileup2cns, mpileup2indel commands).

Just set --output-vcf to 1.

Yours,

Dan Koboldt

**mark.dunning** · 05-02-2012, 06:03 AM

Can I ask if the vcf provided by varscan is valid though? I have used the latest version and tried to annotate with annovar (via their conversion perl script) but I get an error.

NOTICE: for SNPs, column 6 and beyond MAY BE heterozygosity status, quality score, read depth, RMS mapping quality, quality by depth, if these information can be recognized automatically
NOTICE: for indels, column 6 and beyond MAY BE heterozygosity status, quality score, read depth, read count supporting indel call, RMS mapping quality, if these information can be recognized automatically

Similarly, using vcf-stats from vcftools also gives an error;

Different number of columns at chr1:12198 (expected 10, got 9)
Error not recoverable, exiting.

Here is the head of my varscan vcf file

##fileformat=VCFv4.0
##source=VarScan2
##INFO=<ID=DP,Number=1,Type=Integer,Description="Total Depth">
##FILTER=<ID=str10,Description="Less than 10% or more than 90% of variant supporting reads on one strand">
##FORMAT=<ID=GT,Number=1,Type=String,Description="Genotype">
##FORMAT=<ID=GQ,Number=1,Type=Integer,Description="Genotype Quality">
##FORMAT=<ID=DP,Number=1,Type=Integer,Description="Read Depth">
#CHROM POS ID REF ALT QUAL FILTER INFO FORMAT Sample1
chr1 12198 G C . PASS DP=107 GT:GQ

P 1/1:35:107
chr1 12266 G A . PASS DP=53 GT:GQ

P 0/1:4:53

Regards,

Mark

**dkrtndhkd** · 05-04-2012, 08:39 PM

what about somatic option??

I couldn't find the vcf file output option command...

**fjrossello** · 05-22-2012, 09:19 PM

Originally posted by dkrtndhkd View Post

what about somatic option??

I couldn't find the vcf file output option command...

Hi dkrtndhkd,

You can also set --output-vcf to 1 for somatic.

Cheers,

Fernando

**oliviajm** · 06-07-2012, 11:41 PM

Originally posted by mark.dunning View Post

Can I ask if the vcf provided by varscan is valid though? I have used the latest version and tried to annotate with annovar (via their conversion perl script) but I get an error.

NOTICE: for SNPs, column 6 and beyond MAY BE heterozygosity status, quality score, read depth, RMS mapping quality, quality by depth, if these information can be recognized automatically
NOTICE: for indels, column 6 and beyond MAY BE heterozygosity status, quality score, read depth, read count supporting indel call, RMS mapping quality, if these information can be recognized automatically

Similarly, using vcf-stats from vcftools also gives an error;

Different number of columns at chr1:12198 (expected 10, got 9)
Error not recoverable, exiting.

Here is the head of my varscan vcf file

##fileformat=VCFv4.0
##source=VarScan2
##INFO=<ID=DP,Number=1,Type=Integer,Description="Total Depth">
##FILTER=<ID=str10,Description="Less than 10% or more than 90% of variant supporting reads on one strand">
##FORMAT=<ID=GT,Number=1,Type=String,Description="Genotype">
##FORMAT=<ID=GQ,Number=1,Type=Integer,Description="Genotype Quality">
##FORMAT=<ID=DP,Number=1,Type=Integer,Description="Read Depth">
#CHROM POS ID REF ALT QUAL FILTER INFO FORMAT Sample1
chr1 12198 G C . PASS DP=107 GT:GQ

P 1/1:35:107
chr1 12266 G A . PASS DP=53 GT:GQ

P 0/1:4:53

Regards,

Mark

Hi Mark,

I got a similar problem with another software when I tried to provide it with a vcf file coming from VarScan mpileup2indel. It seems that in the vcf files obtained with VarScan the QUAL column is empty. So when the file is open by another tool, the number of column is wrong and the data in the columns don't match with the name of the column. ("PASS" should be in the "FILTER" column, and here it seems to be in the "QUAL" column.)
So you need to add a column filled with a dot under the "QUAL" name.
In my case, I used the command :
awk '{ if ($1 ~ "^#") { print $0} else { sub("",".\t",$6); print $1"\t"$2"\t"$3"\t"$4"\t"$5"\t"$6"\t"$7"\t"$8"\t"$9"\t"$10"\t"$11"\t"$12} }' VarScanfile.vcf > outputFile.vcf
and it solved the problem.

Hope it will help you.

Olivia

EDIT : just found this : http://seqanswers.com/forums/showthread.php?t=20000

Topics	Statistics	Last Post
Cancer Metastasis: A Deep Dive into Cellular Plasticity by seqadmin Started by seqadmin, 04-11-2024, 12:08 PM	0 responses 59 views 0 likes	Last Post by seqadmin 04-11-2024, 12:08 PM
Proteogenomic Profiles Offer New Clues in Prostate Cancer by seqadmin Started by seqadmin, 04-10-2024, 10:19 PM	0 responses 57 views 0 likes	Last Post by seqadmin 04-10-2024, 10:19 PM
Novel Diagnostic Assay Enhances Ovarian Cancer Detection by seqadmin Started by seqadmin, 04-10-2024, 09:21 AM	0 responses 48 views 0 likes	Last Post by seqadmin 04-10-2024, 09:21 AM
Evolutionary Dynamics of Centromeres: A Comparative Genomic Analysis by seqadmin Started by seqadmin, 04-04-2024, 09:00 AM	0 responses 55 views 0 likes	Last Post by seqadmin 04-04-2024, 09:00 AM

Seqanswers Leaderboard Ad

Announcement

varscan-annotation pipeline?

Comment

Comment

Comment

Comment

Comment

Latest Articles

ad_right_rmr

News