Unconfigured Ad

Collapse
X
 
  • Filter
  • Time
  • Show
Clear All
new posts
  • maf5
    Junior Member
    • Jun 2016
    • 2

    Binary methylation states after bisulfite sequencing

    Hi everyone. I have sequencing data (illumina Hiseq) from bisulfite treated samples (Arabidopsis). The bisulfite conversion of some selected regions was tested by sanger sequencing initially and looked fine. The library was then made from the same samples using the Diagenode microplex kit.

    Here's the problem: almost all of the cytosines show either total, or zero methylation levels. There are very few (<1%) that show any intermediate levels of methylation. Even on the regions that were already tested by sanger sequencing, which showed a complete range of methylation levels from 0 to 100%.

    So, does anyone know how this bias can come about. My suspicion is that it is either due to an amplification bias when making the libraries, or something in the bioinformatics pipeline. Has anyone experienced similar results from bisulfite sequencing?

    Cheers
  • dpryan
    Devon Ryan
    • Jul 2011
    • 3478

    #2
    How were the samples mapped? I can't really debug issues with the library prep (I'm sure someone else here can), but at least the downstream stuff I can offer advice on.

    Comment

    • nucacidhunter
      Jafar Jabbari
      • Jan 2013
      • 1250

      #3
      Originally posted by maf5 View Post
      Hi everyone. I have sequencing data (illumina Hiseq) from bisulfite treated samples (Arabidopsis). The bisulfite conversion of some selected regions was tested by sanger sequencing initially and looked fine. The library was then made from the same samples using the Diagenode microplex kit.
      Cheers
      I assume you are referring to whole genome bisulfite sequencing and wonder how you have made a library with bisulfite converted DNA. The kit input is dsDNA while BS converted DNA will be single stranded.

      Comment

      • maf5
        Junior Member
        • Jun 2016
        • 2

        #4
        Thanks for the replies, it's taken a little time for me to get the details from our bioinformatician.

        The library was pair-end sequenced. To increase the coverage, I mapped each of the mates as independent libraries. I used bowtie2 to map against the in-silico bisulfite-converted TAIR10 genome. I use a seed length of 25 and allowed at most one mismatch in the seed. If a read maps to multiple positions, I keep only the best alignment in terms of quality and numbers of mismatches.
        Using the mapping information I then remove duplicates.

        Regarding the library preparation, the bisulfite treatment was performed on non-denatured DNA so the majority of the genome would have actually remained double stranded (we are not actually looking for methylation in this case) but the library prep worked fine with the diagenode kit, although with quite a few amplification rounds, which is why I suspect it may be an amplification bias

        Comment

        • GenoMax
          Senior Member
          • Feb 2008
          • 7142

          #5
          The library was pair-end sequenced. To increase the coverage, I mapped each of the mates as independent libraries.
          How is that going to increase coverage?

          Comment

          • dpryan
            Devon Ryan
            • Jul 2011
            • 3478

            #6
            The bioinformatician must be rather new to things. Please advise him/her to not try needlessly reinventing the wheel and instead use bwa-meth or bismark or one of the many preexisting alignment programs. It's incredibly likely that he/she simply screwed things up.

            Edit: Forgot an important "not" above!
            Last edited by dpryan; 08-15-2016, 11:55 AM.

            Comment

            • ecSeq Bioinformatics
              Senior Member
              • May 2012
              • 490

              #7
              DNA Methylation Data Analysis Workshop
              How to use bisulfite-treated sequencing to study DNA methylation

              When?
              22. - 25. November 2016

              Where?
              Leipzig, Germany

              Link?


              ecSeq Bioinformatics is Europe’s leading provider of hands-on bioinformatics workshops and professional data analysis in the field of Next-Generation Sequencing (NGS).

              Comment

              Latest Articles

              Collapse

              • SEQadmin2
                From Collection to Sequencing: Why Sample Preparation and Preservation Define Sequencing Data
                by SEQadmin2


                Data variability is still an issue in sequencing technologies despite the advances in reproducibility and accuracy of these platforms. But the problem does not originate in the sequencing itself, but in the previous steps, before the sample reaches the sequencer.


                The first step is collection, followed by preservation and sample preparation for analysis. Most scientists overlook those steps, but not being careful might just be skewing the experiment’s results.
                ...
                Yesterday, 10:05 AM
              • SEQadmin2
                Single-Cell Sequencing at an Inflection Point: Early Impacts of New Platforms and Emerging Trends
                by SEQadmin2


                With the launch of new single-cell sequencing platforms in 2026, the field stands at an exciting inflection point. This article surveys the most impactful advances in the field and discusses how they’re reshaping research in cancer, immunology, and beyond.


                Introduction

                Single-cell sequencing technologies have undergone remarkable advances over the past decade, transitioning from low-throughput experimental approaches to highly scalable platforms capable of...
                05-22-2026, 06:42 AM
              • SEQadmin2
                Environmental Genomics in the Age of NGS: From Microbes to Conservation Strategies
                by SEQadmin2

                Studying ecosystems means dealing with complex, multi-species communities that are hard to observe at scale. This complexity, however, hides many important questions to be answered, from how biogeochemical cycles work and how climate change can affect species distribution to how conservation strategies can work best.


                Genomics, particularly since the expansion of NGS, has transformed ecosystem ecology. By sequencing environmental DNA, we can now assess biodiversity without direct...
                05-06-2026, 09:04 AM

              ad_right_rmr

              Collapse

              News

              Collapse

              Topics Statistics Last Post
              Started by SEQadmin2, Yesterday, 12:03 PM
              0 responses
              19 views
              0 reactions
              Last Post SEQadmin2  
              Started by SEQadmin2, Yesterday, 11:40 AM
              0 responses
              14 views
              0 reactions
              Last Post SEQadmin2  
              Started by SEQadmin2, 05-28-2026, 11:40 AM
              0 responses
              29 views
              0 reactions
              Last Post SEQadmin2  
              Started by SEQadmin2, 05-26-2026, 10:12 AM
              0 responses
              31 views
              0 reactions
              Last Post SEQadmin2  
              Working...