Seqanswers Leaderboard Ad

Collapse

Announcement

Collapse
No announcement yet.
X
 
  • Filter
  • Time
  • Show
Clear All
new posts

  • reduced representation bisulfite seq MspI library prep - non target reads?

    Hi,

    Just started this again in the Epigenetics section (originally posted in Bioinformatics here), because I realized it's probably a better fit here - sorry for double-posting - redirected the other one to here.

    I've been wondering about the proportion of 'non-target' reads in a RRBS library, i.e. reads that I don't think are associated with a MspI restriction site but still end up in my dataset. I would really appreciate if anyone could share their experience with respect to this - some details about my data below - thanks!

    The library was prepared with MspI digestion with the NEXTFLEX® Bisulfite Library Prep Kit for Illumina, and sequenced paired-end on Illumina Miseq.

    I was expecting that most of my reads had remnants of the MspI cutsite at the start, so I wrote a little script to check that. The expectation is that forward reads should start with 'CGG' or 'TGG', if the first 'C' was methylated or not, respectively. For the reverse reads I was expecting mainly 'CAA', but also a few 'CGA' (unmethylated cytosines were used for end-repair, and I am assuming that bisulfite conversion rate is not 100%), as well as a few 'CAG' or 'CGG' (theoretically possible).
    So, ideally I should have the following combinations (forward - reverse) - please correct me if I am wrong:
    CGG-CAA
    TGG-CAA
    CGG-CGA
    TGG-CGA
    CGG-CAG
    TGG-CAG
    CGG-CGG
    TGG-CGG

    My expectations were met in the way that CGG-CAA and TGG-CAA was by far the most common, ~20% of read pairs and ~6% of read pairs, respectively. The other combinations account for less than 1% in total. What puzzles me though is that I only get ~30% read pairs that have any of the above patterns.

    I figured that perhaps some of the fragments I get after the MspI digestion are somehow sheared during the library prep and loose the cutsite on one side, so I relaxed the search criteria to also count read pairs that have the expected pattern only in the forward read OR in the reverse read. An additional ~28% of the read pairs fell into this category.

    Which leaves me with >40% of read pairs that don't have any of the expected patterns, neither in the forward nor the reverse read.

    So, my question: Is this normal? Sure there will be some sequencing error, but the read quality is very good overall and the error rate should be very low at the start of the reads, so that can't affect 40% reads. Am I missing something here?

    Thanks in advance for your help!

    cheers,
    Christoph

Latest Articles

Collapse

  • seqadmin
    Best Practices for Single-Cell Sequencing Analysis
    by seqadmin



    While isolating and preparing single cells for sequencing was historically the bottleneck, recent technological advancements have shifted the challenge to data analysis. This highlights the rapidly evolving nature of single-cell sequencing. The inherent complexity of single-cell analysis has intensified with the surge in data volume and the incorporation of diverse and more complex datasets. This article explores the challenges in analysis, examines common pitfalls, offers...
    Today, 07:15 AM
  • seqadmin
    Latest Developments in Precision Medicine
    by seqadmin



    Technological advances have led to drastic improvements in the field of precision medicine, enabling more personalized approaches to treatment. This article explores four leading groups that are overcoming many of the challenges of genomic profiling and precision medicine through their innovative platforms and technologies.

    Somatic Genomics
    “We have such a tremendous amount of genetic diversity that exists within each of us, and not just between us as individuals,”...
    05-24-2024, 01:16 PM

ad_right_rmr

Collapse

News

Collapse

Topics Statistics Last Post
Started by seqadmin, Today, 08:18 AM
0 responses
11 views
0 likes
Last Post seqadmin  
Started by seqadmin, Today, 08:04 AM
0 responses
12 views
0 likes
Last Post seqadmin  
Started by seqadmin, 06-03-2024, 06:55 AM
0 responses
13 views
0 likes
Last Post seqadmin  
Started by seqadmin, 05-30-2024, 03:16 PM
0 responses
27 views
0 likes
Last Post seqadmin  
Working...
X