Unconfigured Ad

Collapse
X
 
  • Filter
  • Time
  • Show
Clear All
new posts
  • zeam
    Member
    • Oct 2010
    • 43

    Some queries on DNA methylation when processing the BS-seq data

    I saw some papers on DNA methylation using BS-seq,now I meet some questions on DNA methylation when working with the BS-seq data:

    1)first,what is the minimum required threshold of percent methylation at a site to dedine a methylcytosine,I want to use 40%,but I don't know if it was appropriate;

    2)second,somebody mentioned methylation genes in some papers,so how to define a methylated genes,what the criteria is ? Because you know,the raw results of BS-seq is the methylation level at each site,and both of the methylation sites and the C are not even between a gene;

    3)Also,I want to know how to define a differentially methylated region(DMR),this question is in fact the same as my second one because of the uneven methylation cytosine in the chromosomes;and in plant how long (300 bp?) to set the length of DMR in your experiences

    Thanks for your time!
    Best wishes,
  • simonandrews
    Simon Andrews
    • May 2009
    • 870

    #2
    I don't think you're going to find definitive answers to your questions because a lot of this is going to vary depending on the exact system you're working with.

    From the data we've seen the majority of cytosines we measured had a methylation percentage either above 80% or below 20%. Our standard filters are therefore 75% and 25%. However we've also gone back to look at regions with a known methylation state and have found that in those regions even if you apply no filtering at all (so set your cutoff at exactly 50%) the number of incorrect calls we make is still very low (below 5%). The errors also tend not to be clustered so the chance of miscalling a region containing several Cs is even lower.

    In terms of identifying DMRs you can either take a purely statistical approach where you look at the proportion of meth vs unmeth in two samples and see if those two groups are significantly different, or you set a cutoff on the amount of change you want to see and then test only regions which pass the initial filter. We took the second approach, which seemed to work out OK. Ideally you don't want to define a size of DMR since they could be of variable length. We've not seen convincing DMRs which were very short though, so you could set a lower cutoff of a couple of hundred bases to remove noise from your results.

    Comment

    • zeam
      Member
      • Oct 2010
      • 43

      #3
      To simonandrews

      Originally posted by simonandrews View Post
      I don't think you're going to find definitive answers to your questions because a lot of this is going to vary depending on the exact system you're working with.

      From the data we've seen the majority of cytosines we measured had a methylation percentage either above 80% or below 20%. Our standard filters are therefore 75% and 25%. However we've also gone back to look at regions with a known methylation state and have found that in those regions even if you apply no filtering at all (so set your cutoff at exactly 50%) the number of incorrect calls we make is still very low (below 5%). The errors also tend not to be clustered so the chance of miscalling a region containing several Cs is even lower.

      In terms of identifying DMRs you can either take a purely statistical approach where you look at the proportion of meth vs unmeth in two samples and see if those two groups are significantly different, or you set a cutoff on the amount of change you want to see and then test only regions which pass the initial filter. We took the second approach, which seemed to work out OK. Ideally you don't want to define a size of DMR since they could be of variable length. We've not seen convincing DMRs which were very short though, so you could set a lower cutoff of a couple of hundred bases to remove noise from your results.
      Hi,there!
      Thanks for you reply,and I have one question you didn't answer:how to define a methylated gene,or how to define a methylated region.Imaging I have the methylation information (0-1) at each context(CpG,CHG,CHH )for each chromosomes,how to define a methylated region(what the algoritm is) in your experience?
      Thanks again!

      Comment

      • sanamjeet
        Junior Member
        • Mar 2011
        • 1

        #4
        Methylation level/percentage

        I want to ask what does it mean by methylation level ? is it the percentage of the sequence in whole genome methylated ? or something else.

        Comment

        • simonandrews
          Simon Andrews
          • May 2009
          • 870

          #5
          Originally posted by sanamjeet View Post
          I want to ask what does it mean by methylation level ? is it the percentage of the sequence in whole genome methylated ? or something else.
          It will be the percentage of all cytosines which are methylated. In some cases the cytosines will be divided into different contexts (CG CHG CHH etc) and you can quote a methylation level for each context.

          Comment

          Latest Articles

          Collapse

          • GATTACAT
            Reply to Nine Things a Sample Prep Scientist Thinks About Before Sequencing
            by GATTACAT
            Love this - good data definitely starts from good input, and poor input can only give relatively poor data. I particularly like the mention of Nanodrop/absorbance based methods for quantification. It's such a toss up if you'll get an accurate reading or what amounts to a randomly generated number, and a lot of library/sequencing related issues can be traced back to poor quant.
            Today, 11:43 AM
          • SEQadmin2
            Nine Things a Sample Prep Scientist Thinks About Before Sequencing
            by SEQadmin2


            I’m not a sequencing expert. I’m a purification scientist who uses NGS to evaluate workflows my group develops. With this perspective, we think about the sample first and the NGS workflow second. The sequencer is an exceptionally honest reporter, but it can only report on what you give it, so whether you get clean, interpretable data from an NGS workflow is largely determined before you begin.

            Here are nine questions we think about, in roughly the order they matter, before...
            06-18-2026, 07:11 AM
          • SEQadmin2
            From Collection to Sequencing: Why Sample Preparation and Preservation Define Sequencing Data
            by SEQadmin2


            Data variability is still an issue in sequencing technologies despite the advances in reproducibility and accuracy of these platforms. But the problem does not originate in the sequencing itself, but in the previous steps, before the sample reaches the sequencer.


            The first step is collection, followed by preservation and sample preparation for analysis. Most scientists overlook those steps, but not being careful might just be skewing the experiment’s results.
            ...
            06-02-2026, 10:05 AM

          ad_right_rmr

          Collapse

          News

          Collapse

          Topics Statistics Last Post
          Started by SEQadmin2, Yesterday, 05:37 AM
          0 responses
          8 views
          0 reactions
          Last Post SEQadmin2  
          Started by SEQadmin2, 06-26-2026, 11:10 AM
          0 responses
          18 views
          0 reactions
          Last Post SEQadmin2  
          Started by SEQadmin2, 06-17-2026, 06:09 AM
          0 responses
          52 views
          0 reactions
          Last Post SEQadmin2  
          Started by SEQadmin2, 06-09-2026, 11:58 AM
          0 responses
          110 views
          0 reactions
          Last Post SEQadmin2  
          Working...