Hi all-- I am attempting to filter out false "SNPs" in my bowtie assemblies (yeast genomic DNA). One major predictor of SNP falseness is being located in a regions that shows sequence similarity with somewhere else in the genome (e.g., paralogs, telomeres).
I've attempted to limit the read mismapping that underlies these SNPs by tweaking the bowtie parameters, but I still get quite a few. So an alternative would be to ask, for each region containing a putative SNP, what is the local uniqueness?
Does anyone know of a previously generated track that catalogs this property? Or better still, a tool that would perform a sliding window analysis (probably by parsing BLAST or BLAT) to record it for any given sequence?
Thanks!
I've attempted to limit the read mismapping that underlies these SNPs by tweaking the bowtie parameters, but I still get quite a few. So an alternative would be to ask, for each region containing a putative SNP, what is the local uniqueness?
Does anyone know of a previously generated track that catalogs this property? Or better still, a tool that would perform a sliding window analysis (probably by parsing BLAST or BLAT) to record it for any given sequence?
Thanks!
Comment