I have recently sequenced a number of PCR amplicons on an Illumina MiSeq, in a sample that must be a compund heterozygote for 2 SNPs (based on phenotype). The 2 SNPs are very close together, so in theory any read that covers both SNPs should have one of the variants, but never both. However, when I physically count them, only around 80% of reads follow this pattern, while the remaining 20% look like they have both variants, or neither.
The read length was set to 150bp, but quite a few of the reads (in the 20% group that don't appear as expected) are much shorter than that, as short as 40bp (when BAM files are viewed in IGV). Also, in many of the reads from the 20% group, one or both of the SNPs are within about 5 nucleotides of the end of the read.
I have since realised that the short reads are due to very short fragment/insert sizes to begin with (so the mate pairs are the same sequence/overlap 100%).
However, will this have an effect on the reliability of the variant calls?
If not, how do I explain that only 80% of the reads that span both SNPs (around 200 reads), have genotypes concordant with a compound heterozygote?
Unconfigured Ad
Collapse
Latest Articles
Collapse
-
by SEQadmin2
Data variability is still an issue in sequencing technologies despite the advances in reproducibility and accuracy of these platforms. But the problem does not originate in the sequencing itself, but in the previous steps, before the sample reaches the sequencer.
The first step is collection, followed by preservation and sample preparation for analysis. Most scientists overlook those steps, but not being careful might just be skewing the experiment’s results.
...-
Channel: Articles
Yesterday, 10:05 AM -
-
by SEQadmin2
With the launch of new single-cell sequencing platforms in 2026, the field stands at an exciting inflection point. This article surveys the most impactful advances in the field and discusses how they’re reshaping research in cancer, immunology, and beyond.
Introduction
Single-cell sequencing technologies have undergone remarkable advances over the past decade, transitioning from low-throughput experimental approaches to highly scalable platforms capable of...-
Channel: Articles
05-22-2026, 06:42 AM -
-
by SEQadmin2
Studying ecosystems means dealing with complex, multi-species communities that are hard to observe at scale. This complexity, however, hides many important questions to be answered, from how biogeochemical cycles work and how climate change can affect species distribution to how conservation strategies can work best.
Genomics, particularly since the expansion of NGS, has transformed ecosystem ecology. By sequencing environmental DNA, we can now assess biodiversity without direct...-
Channel: Articles
05-06-2026, 09:04 AM -
ad_right_rmr
Collapse
News
Collapse
| Topics | Statistics | Last Post | ||
|---|---|---|---|---|
|
Started by SEQadmin2, Yesterday, 12:03 PM
|
0 responses
19 views
0 reactions
|
Last Post
by SEQadmin2
Yesterday, 12:03 PM
|
||
|
Started by SEQadmin2, Yesterday, 11:40 AM
|
0 responses
14 views
0 reactions
|
Last Post
by SEQadmin2
Yesterday, 11:40 AM
|
||
|
Started by SEQadmin2, 05-28-2026, 11:40 AM
|
0 responses
29 views
0 reactions
|
Last Post
by SEQadmin2
05-28-2026, 11:40 AM
|
||
|
Started by SEQadmin2, 05-26-2026, 10:12 AM
|
0 responses
31 views
0 reactions
|
Last Post
by SEQadmin2
05-26-2026, 10:12 AM
|