A newbie question:

How do I calculate the probability of a random set of sequences (at a specified length, say short reads of 25bp) aligning to a set window length (say 10kb)? Essentially, I'd like to know the

**sequence coverage probability along a specified length of DNA**.I'd like to use this sequence coverage probability to test whether what I see (for example, say I see 3 reads within a particular 10kb window) is truly significant or aligned by random chance.

Please let me know your thoughts and whether this is a valid question to ask in the first place.

Thanks!

