Is there a method out there that would allow me to determine whether a particular sequence motif (DNA) has adequate sequencing depth of coverage across a genome? What I am interested in seeing is whether or not any specific sequence motifs are associated with decreased levels of sequencing coverage across the genome. I have illumina miseq data, and can see one of two ways of doing this. 1) I identify regions with low coverage, identify motifs within these regions that are common between all or most of these regions, then compare them to random other equally sized regions in the genome. Or 2) I randomly generate kmers across the genome, and then evaluate the sequencing coverage across these kmers and determine if any have significantly lower values than the others. I'm wondering if anyone has done this before, and whether there might be a tool that has been created to do this?
Thanks
Thanks
Comment