Seqanswers Leaderboard Ad

**Brian Bushnell** · 09-09-2014, 08:17 PM

For human data, I suggest calling mutations against the standard human genome, then comparing them against known databases, such as the human 1000 genomes project or other databases.

There are gold standards, but gold is a relative and dynamic term in any advancing industry. Particularly, exon-capture is not at all replicable between different platforms.

**bt27uk** · 09-09-2014, 09:37 PM

I think you are asking about coverage, where the first reply seemed to be talking about something a bit different.

I wonder if it's not so much a statistical comparison you are after here, but rather a cutoff level. In this case, the challenge becomes what regions to measure and how to set the cutoffs for those regions.

From your past experience, does the mean depth tell you what you need to know? If you are working with panels, then perhaps it would be relevant to choose a few regions where you know the coverage range you would consider normal or good and check whether the coverage from a given run is at that level?

How to set cutoffs, which would act as the warnings that a sample may not be of the quality you need, could involve, for example, basic exploratory data analysis, such as tables and plots of the coverage of your gold sample and looking at the distribution of coverage over the mapping, (or over the regions you work with). From this, determine values that would be meaningful to check for in your samples. I would likely test the test you come up with by running against other samples you know were considered good or bad in the past, to see if your tests would have flagged up the samples you hope it will.

Having said all that, my suspicion is that this question may be a solved problem and that others in the forum will have more mature ideas about processes and tools to use for this purpose.

Guess we'll find out, right? :-)

**shimbalama** · 09-09-2014, 09:41 PM

Thanks Brian.

I do all that. What I am trying to do is QC on the negative var calls. So every base in every gene of interest (GOI).

What I am interested in is the mean read depth of every GOI that comes off my machine and whether it is significantly different to the mean depth I have defined as 'gold'. So the question is about statistical analysis only.

**shimbalama** · 09-09-2014, 09:46 PM

Thanks bt27uk,

Much more on point.

I have implemented an approach similar to what you suggest, ie. if sample mean < 20x but gold isn't we want to know. My boss wants a P value though.

Cheers,
Liam

**bt27uk** · 09-10-2014, 07:17 AM

If your supervisor wants a p-value, then I have likely missed the point.

I originally assumed the aim was to ask a question like "does this sample have adequate coverage for my purposes?”. For the purpose of noting samples that might not have adequate coverage for downstream analysis, I think a set of coverage cutoffs for the various genes of interest, based on some lower limit you determine based on your knowledge of a “good” sample, would be a reasonable way forward.

To me, a p-value suggests questions more long the line of "does this sample have (any, some, all?) genes that have coverage that fall outside a range that constitutes the population of what are considered good samples?" That is a rather more complex question to approach.

Topics	Statistics	Last Post
A Close Examination at Probiotic-Related Bacteremia by seqadmin Started by seqadmin, Today, 08:06 AM	0 responses 11 views 0 likes	Last Post by seqadmin Today, 08:06 AM
Expanded Genetic Insights into Blood Pressure Regulation by seqadmin Started by seqadmin, 04-30-2024, 12:17 PM	0 responses 13 views 0 likes	Last Post by seqadmin 04-30-2024, 12:17 PM
The Role of Enhancers in Defining Cell Fate by seqadmin Started by seqadmin, 04-29-2024, 10:49 AM	0 responses 19 views 0 likes	Last Post by seqadmin 04-29-2024, 10:49 AM
Expanding the Horizons of Cellular Research with the Single Cell Atlas by seqadmin Started by seqadmin, 04-25-2024, 11:49 AM	0 responses 26 views 0 likes	Last Post by seqadmin 04-25-2024, 11:49 AM

Seqanswers Leaderboard Ad

Announcement

Comparing read depths per gene/exon between samples

Comment

Comment

Comment

Comment

Comment

Latest Articles

ad_right_rmr

News