Unconfigured Ad

**me_myself_andI** · 05-14-2013, 04:36 AM

Shameless plug: one option is to simply not filter for mapping quality, and instead to use a low frequency variant caller that builds mapping quality into its model, e.g. LoFreq

Andreas

**JackieBadger** · 05-14-2013, 05:22 AM

Application Unavailable | Springer Nature

http://genomebiology.com/content/13/5/R34

**JackieBadger** · 05-14-2013, 05:28 AM

I would be very cautious of calling a variant 0.09% of the total depth of a sample. In my experience PCR-carryover contamination, and repeatable sequence specific base mismatch errors can appear at a higher percentage than this.
If something is rare in your sample, but real, you should still see it at considerably higher depth than errors. NGS can quantify copy number of alleles but to get 3 out of 3155 reads would leave me to believe that it is not "real".

**nieder** · 05-15-2013, 03:09 AM

Thank you both for your help. JackieBadger's thought about 3/3155 reads not being 'real' was indeed my worry, since it's such a low frequency. We are looking for any evidence that some rare parasitic gene variants exist in a specific local population of field isolates, and were thrilled to find a few of them in some of the samples. But then when I started doing some basic filtering, they disappeared, and so I was hoping there was some method to provide some level of statistical confirmation.

Running LoFreq on the BAM files also showed that the variants we had found were not real. Thankfully, this particular case has a happy ending regardless of which way it turned out, since either answer provides solid information towards explaining the possible spread of this variant across the endemic area.

**JackieBadger** · 05-15-2013, 03:41 AM

so parasitic gene variants...would it not be easier to barcode individuals? Is that possible or are you doing some metagenomic/pooling type analysis.
Just because a variant is rare in a population, it still represents an allelic variant that should be seen at a decent depth.
Have you confirmed the variant using cloning?

**nieder** · 05-15-2013, 05:17 AM

Each individual human from which parasite samples were collected is individually barcoded. Unfortunately, the parasitic DNA for each individual is a is a pooled sample of all the parasitic eggs collected from that person (pooling is a necessity from the way eggs are harvested), and the infection is known to not be clonal. One of the things we are trying to find is whether or not the known lab variants are actually found in the wild. We were thrilled to find them in some of our samples, but their low frequency and low quality seems to indicate that they might not be real. We're trying to see if there's a way to take the variant rates in other positions of the gene and draw conclusions about the variant rate at our positions of interest.

Finally, at this point, we don't have the ability and/or capacity to do any cloning of our samples.

**JackieBadger** · 05-15-2013, 05:34 AM

are you preparing/sequencing your lab samples and wild samples in the same room, with same pipets etc, or same sequencer?
I would say that you are seeing low level pcr-carryover.

I target the same amplicon in individuals, and then pool into the same library. Each amplicon may contain say 8 alleles. When we pool 1000 individuals, we still see rare alleles i.e. present in just one individual, at significant depths.

What makes you think these rare variants are real anyway? Do you sequence them at high numbers in your lab samples? If they then turn up as low copy number in wild I would say its low level contamination. There was a recent thread also discussing the rate of contamination between MiSeq runs, based on carry over within the machine

Topics	Statistics	Last Post
New Genomic Method Uncovers Ancient Hominin DNA by SEQadmin2 Started by SEQadmin2, Today, 02:55 AM	0 responses 7 views 0 reactions	Last Post by SEQadmin2 Today, 02:55 AM
Study Captures the First Moments of DNA Replication by SEQadmin2 Started by SEQadmin2, 07-24-2026, 12:17 PM	0 responses 12 views 0 reactions	Last Post by SEQadmin2 07-24-2026, 12:17 PM
Chemotherapy Leaves Detectable DNA Signatures in Childhood Tumors by SEQadmin2 Started by SEQadmin2, 07-23-2026, 11:41 AM	0 responses 12 views 0 reactions	Last Post by SEQadmin2 07-23-2026, 11:41 AM
Single-Cell Atlases Skew Toward European Ancestry, Analysis Finds by SEQadmin2 Started by SEQadmin2, 07-20-2026, 11:10 AM	0 responses 24 views 0 reactions	Last Post by SEQadmin2 07-20-2026, 11:10 AM

Unconfigured Ad

low frequency variants vs mapping quality

Comment

Comment

Comment

Comment

Comment

Comment

Comment

Latest Articles

ad_right_rmr

News