Hi
While using Varscan (V.2.3.1); I have used default options but get different results (for the same dataset) using indel.vcf filters (on snp.vcf) and simply using indelvcf filter (on snp dataset). In theory, this is just a file format variation and the stats should remain the same. However I see a difference due to p-value.
time java -jar ~/tools/VarScan.v2.3.1.jar somaticFilter 65_varscan-out.snp -min-var-freq 0.5 --indel-file 65_varscan-out.indel --output-file 65-filtered
Window size: 10
Window SNPs: 3
Indel margin: 3
Reading input from 65_varscan-out.snp
2962 cluster SNPs identified
Reading input from 65_varscan-out.snp
88168 variants in input stream
13612 failed to meet coverage requirement
5579 failed to meet reads2 requirement
24230 failed to meet varfreq requirement
40748 failed to meet p-value requirement
45 in SNP clusters were removed
1 were removed near indels
3953 passed filters
real 0m3.532s
user 0m2.268s
sys 0m0.188s
time java -jar ~/tools/VarScan.v2.3.1.jar somaticFilter 65_varscan-out.snp.vcf -min-var-freq 0.5 --indel-file 65_varscan-out.indel.vcf --output-file 65-filtered.vcf
Window size: 10
Window SNPs: 3
Indel margin: 3
Reading input from 65_varscan-out.snp.vcf
2972 cluster SNPs identified
Reading input from 65_varscan-out.snp.vcf
88177 variants in input stream
13615 failed to meet coverage requirement
5583 failed to meet reads2 requirement
24231 failed to meet varfreq requirement
2862 failed to meet p-value requirement
367 in SNP clusters were removed
39 were removed near indels
41480 passed filters
real 0m1.506s
user 0m2.504s
sys 0m0.272s
Any particular reason for these differences ? Please note that on a quick comparison between *.vcf files and its corresponding snp and indel files, there are no differences when compared by its chr and position.
Many Thanks
While using Varscan (V.2.3.1); I have used default options but get different results (for the same dataset) using indel.vcf filters (on snp.vcf) and simply using indelvcf filter (on snp dataset). In theory, this is just a file format variation and the stats should remain the same. However I see a difference due to p-value.
time java -jar ~/tools/VarScan.v2.3.1.jar somaticFilter 65_varscan-out.snp -min-var-freq 0.5 --indel-file 65_varscan-out.indel --output-file 65-filtered
Window size: 10
Window SNPs: 3
Indel margin: 3
Reading input from 65_varscan-out.snp
2962 cluster SNPs identified
Reading input from 65_varscan-out.snp
88168 variants in input stream
13612 failed to meet coverage requirement
5579 failed to meet reads2 requirement
24230 failed to meet varfreq requirement
40748 failed to meet p-value requirement
45 in SNP clusters were removed
1 were removed near indels
3953 passed filters
real 0m3.532s
user 0m2.268s
sys 0m0.188s
time java -jar ~/tools/VarScan.v2.3.1.jar somaticFilter 65_varscan-out.snp.vcf -min-var-freq 0.5 --indel-file 65_varscan-out.indel.vcf --output-file 65-filtered.vcf
Window size: 10
Window SNPs: 3
Indel margin: 3
Reading input from 65_varscan-out.snp.vcf
2972 cluster SNPs identified
Reading input from 65_varscan-out.snp.vcf
88177 variants in input stream
13615 failed to meet coverage requirement
5583 failed to meet reads2 requirement
24231 failed to meet varfreq requirement
2862 failed to meet p-value requirement
367 in SNP clusters were removed
39 were removed near indels
41480 passed filters
real 0m1.506s
user 0m2.504s
sys 0m0.272s
Any particular reason for these differences ? Please note that on a quick comparison between *.vcf files and its corresponding snp and indel files, there are no differences when compared by its chr and position.
Many Thanks
Comment