Hi all, I have a VCF file which contains a raw list of mutations/snps for my study. I would like to exclude known SNPs from dbSNP131/hg19 (which I also have in VCF format).
I was thinking about BEDTools, something like
but AFAIK this only exclude those variations that happen at the same position. That means that if I have a novel variation at the same position, that would be excluded too.
Does anybody have an idea/processing pipeline to deal with this? I was looking at vcftools but couldn't find anything helpful
d
I was thinking about BEDTools, something like
Code:
intersectBed -a MyList.vcf -b hg19.snp131.vcf.gz -v > specific.vcf
Does anybody have an idea/processing pipeline to deal with this? I was looking at vcftools but couldn't find anything helpful
d
Comment