Unconfigured Ad

**simonandrews** · 02-21-2011, 12:24 AM

I don't think you're going to find definitive answers to your questions because a lot of this is going to vary depending on the exact system you're working with.

From the data we've seen the majority of cytosines we measured had a methylation percentage either above 80% or below 20%. Our standard filters are therefore 75% and 25%. However we've also gone back to look at regions with a known methylation state and have found that in those regions even if you apply no filtering at all (so set your cutoff at exactly 50%) the number of incorrect calls we make is still very low (below 5%). The errors also tend not to be clustered so the chance of miscalling a region containing several Cs is even lower.

In terms of identifying DMRs you can either take a purely statistical approach where you look at the proportion of meth vs unmeth in two samples and see if those two groups are significantly different, or you set a cutoff on the amount of change you want to see and then test only regions which pass the initial filter. We took the second approach, which seemed to work out OK. Ideally you don't want to define a size of DMR since they could be of variable length. We've not seen convincing DMRs which were very short though, so you could set a lower cutoff of a couple of hundred bases to remove noise from your results.

**zeam** · 02-22-2011, 05:42 PM

To simonandrews

Originally posted by simonandrews View Post

I don't think you're going to find definitive answers to your questions because a lot of this is going to vary depending on the exact system you're working with.

From the data we've seen the majority of cytosines we measured had a methylation percentage either above 80% or below 20%. Our standard filters are therefore 75% and 25%. However we've also gone back to look at regions with a known methylation state and have found that in those regions even if you apply no filtering at all (so set your cutoff at exactly 50%) the number of incorrect calls we make is still very low (below 5%). The errors also tend not to be clustered so the chance of miscalling a region containing several Cs is even lower.

In terms of identifying DMRs you can either take a purely statistical approach where you look at the proportion of meth vs unmeth in two samples and see if those two groups are significantly different, or you set a cutoff on the amount of change you want to see and then test only regions which pass the initial filter. We took the second approach, which seemed to work out OK. Ideally you don't want to define a size of DMR since they could be of variable length. We've not seen convincing DMRs which were very short though, so you could set a lower cutoff of a couple of hundred bases to remove noise from your results.

Hi,there!
Thanks for you reply,and I have one question you didn't answer:how to define a methylated gene,or how to define a methylated region.Imaging I have the methylation information (0-1) at each context(CpG,CHG,CHH )for each chromosomes,how to define a methylated region(what the algoritm is) in your experience?
Thanks again!

**sanamjeet** · 03-17-2011, 07:30 AM

Methylation level/percentage

I want to ask what does it mean by methylation level ? is it the percentage of the sequence in whole genome methylated ? or something else.

**simonandrews** · 03-17-2011, 07:40 AM

Originally posted by sanamjeet View Post

I want to ask what does it mean by methylation level ? is it the percentage of the sequence in whole genome methylated ? or something else.

It will be the percentage of all cytosines which are methylated. In some cases the cytosines will be divided into different contexts (CG CHG CHH etc) and you can quote a methylation level for each context.

Topics	Statistics	Last Post
New AI Model Captures Long-Range Genomic Signals to Improve RNA Splice Site Prediction by SEQadmin2 Started by SEQadmin2, Yesterday, 05:37 AM	0 responses 8 views 0 reactions	Last Post by SEQadmin2 Yesterday, 05:37 AM
Large-Scale Protein Screen Uncovers Hidden Regulators of Alternative Polyadenylation by SEQadmin2 Started by SEQadmin2, 06-26-2026, 11:10 AM	0 responses 18 views 0 reactions	Last Post by SEQadmin2 06-26-2026, 11:10 AM
Whole-Genome Sequencing Traces Faroe Islands Ancestry to a North Atlantic Founder Population by SEQadmin2 Started by SEQadmin2, 06-17-2026, 06:09 AM	0 responses 52 views 0 reactions	Last Post by SEQadmin2 06-17-2026, 06:09 AM
Sequencing the Two-Toed Sloth Genome Reveals Jumping Genes Tied to Its Extreme Metabolism by SEQadmin2 Started by SEQadmin2, 06-09-2026, 11:58 AM	0 responses 110 views 0 reactions	Last Post by SEQadmin2 06-09-2026, 11:58 AM

Unconfigured Ad

Some queries on DNA methylation when processing the BS-seq data

Comment

Comment

Comment

Comment

Latest Articles

ad_right_rmr

News