I'm new to the NGS field and have just started to analyze my first set of data. I have a single lane of paired-end DNA sequence with 8 samples pooled into one lane. My libraries were generated with the Trueseq kit (as per Illumina's instructions) and run on a HiSeq2000.
I've looked at several chromosomes and the data looks fine. So far so good. However, when I started working with the mtDNA I noticed that for 2 of the samples I have WAY more reads. Instead of ~12 reads per base, I have ~100 across the entire mtDNA.
The most obvious explanation I can think of is PCR bias or some sort of contamination, but all samples were prepared at the same time so I am not sure why only 2/8 of the samples are affected. Also the rest of the genome seems to not have this bias for those 2 samples. Am I missing something? Anyone else have this issue?
I've looked at several chromosomes and the data looks fine. So far so good. However, when I started working with the mtDNA I noticed that for 2 of the samples I have WAY more reads. Instead of ~12 reads per base, I have ~100 across the entire mtDNA.
The most obvious explanation I can think of is PCR bias or some sort of contamination, but all samples were prepared at the same time so I am not sure why only 2/8 of the samples are affected. Also the rest of the genome seems to not have this bias for those 2 samples. Am I missing something? Anyone else have this issue?
Comment