Hello,
I noticed yesterday the new release of VarScan2. Of course, I tried it
on two samples, by changing only the version.
I wonder what are the changes. I read "VarScan v2.3.1 released with bug fixes", but which ones?
I read also the change in VCF compatibility. For my problematic, I do not use VCF format, I'm rather interested in the bug fixes.
I will show you the differences that I got between 2 versions on 2 samples:
When I saw that the number of positions with sufficient coverage decreased, I though that the bug mentioned here: seqanswers.com/forums/showthread.php?t=20791 (coverage criteria not satisfied) was solved. Unfortunately, it doesn't seem to be the case: with the new version, I have such results:
Then, I wanted to see if the strand filter was activated, whatever the parameter value, in this new version. As you can see, I didn't ask for the strand filter because I don't want it ; nevertheless, 11041 positions were removed by the strand filter.
Finally, there is still this issue: tumor_reads1 different from tumor_reads1_plus+tumor_reads1_minus for the INDELs
My questions are:
Thank you in advance for your help and thank you Dan Koboldt for maintaining your tool, which has plenty of advantages, even if I mentioned here issues only
Jane
I noticed yesterday the new release of VarScan2. Of course, I tried it

I wonder what are the changes. I read "VarScan v2.3.1 released with bug fixes", but which ones?
I read also the change in VCF compatibility. For my problematic, I do not use VCF format, I'm rather interested in the bug fixes.
I will show you the differences that I got between 2 versions on 2 samples:
java -Xmx15g -jar VarScan.v2.3.1.jar somatic /data/cd3.mpileup /data/159cd14.mpileup --output-snp /data/VarScan2/varscan.snp --output-indel /data/VarScan2/varscan.indel --min-coverage 10 --min-var-freq 0.2 --min-freq-for-hom 0.75 --normal-purity 0.8 --tumor-purity 1 --p-value 0.05 --somatic-p-value 0.05 --strand-filter 0 --min-avg-qual 20 --min-strands2 0 --min-reads2 0
134948529 positions in tumor
134724809 positions shared in normal
87521347 had sufficient coverage for comparison
87408822 were called Reference
178 were mixed SNP-indel calls and filtered
11041 were removed by the strand filter
84099 were called Germline
8338 were called LOH
8284 were called Somatic
585 were called Unknown
0 were called Variant
134948529 positions in tumor
134724809 positions shared in normal
87521347 had sufficient coverage for comparison
87408822 were called Reference
178 were mixed SNP-indel calls and filtered
11041 were removed by the strand filter
84099 were called Germline
8338 were called LOH
8284 were called Somatic
585 were called Unknown
0 were called Variant
java -Xmx15g -jar VarScan.v2.2.10.jar somatic /data/cd3.mpileup /data/159cd14.mpileup --output-snp /data/VarScan2/varscan.snp --output-indel /data/VarScan2/varscan.indel --min-coverage 10 --min-var-freq 0.2 --min-freq-for-hom 0.75 --normal-purity 0.8 --tumor-purity 1 --p-value 0.05 --somatic-p-value 0.05 --strand-filter 0 --min-avg-qual 20 --min-strands2 0 --min-reads2 0
134948529 positions in tumor
134724809 positions shared in normal
87521679 had sufficient coverage for comparison
87411857 were called Reference
117 were mixed SNP-indel calls and filtered
10460 were removed by the strand filter
82914 were called Germline
7907 were called LOH
8066 were called Somatic
358 were called Unknown
0 were called Variant
134948529 positions in tumor
134724809 positions shared in normal
87521679 had sufficient coverage for comparison
87411857 were called Reference
117 were mixed SNP-indel calls and filtered
10460 were removed by the strand filter
82914 were called Germline
7907 were called LOH
8066 were called Somatic
358 were called Unknown
0 were called Variant
chr1 24334459 A C 6 0 0% A 3 1 25% M Somatic 1.0 0.39999999999999963 0 3 0 1
Finally, there is still this issue: tumor_reads1 different from tumor_reads1_plus+tumor_reads1_minus for the INDELs
chrom position ref var normal_reads1 normal_reads2 normal_var_freq normal_gt tumor_reads1 tumor_reads2 tumor_var_freq tumor_gt somatic_status variant_p_value somatic_p_value tumor_reads1_plus tumor_reads1_minus tumor_reads2_plus tumor_reads2_minus
chr1 3801133 A -T 12 0 0% A 8 3 27,27% */-T Somatic 1.0 0.09316770186335391 0 11 0 3
chr1 3801133 A -T 12 0 0% A 8 3 27,27% */-T Somatic 1.0 0.09316770186335391 0 11 0 3
- Which bugs have been fixed in this new release?
- Do you know a way to avoid the problems I meet? (especially concerning the strand filter, it's a big problem for me - it's easy to handle the coverage criteria)
- For Dan Koboldt: do you intend to solve some of these issues? Or do I do something wrong for obtaining such results?
Thank you in advance for your help and thank you Dan Koboldt for maintaining your tool, which has plenty of advantages, even if I mentioned here issues only

Jane
Comment