Unconfigured Ad

Collapse
X
 
  • Filter
  • Time
  • Show
Clear All
new posts
  • medalofhonour
    Member
    • Jul 2011
    • 18

    Tools to determine sequence contamination in Methyl-Seq data ?

    Hi,

    I have a question regarding determining sequence contamination from Methyl-Seq experiments.

    As we all know, for Methyl-Seq experiments, the bisulfite conversion step converts unmethylated C's to U's, which after sequencing become T's.

    I have Methyl-Seq data from a supposedly "human" sample sequenced on MiSeq, and the problem is that only 20% of the reads map to human reference hg19, using a methyl-seq specialized aligner BSMAP.

    I wish to find out where the 80% of the sequences are coming from, since they don't map to human.

    As we can intuitively see, I cannot just do something like take the overrepresented sequences from FASTQC, and do a quick BLAST to search for possible contamination from other organisms, since the overrepresented sequences could be bisulfite converted.

    Is there a tool out there that works like BLAST but takes into account bisulfite conversion while mapping sequences ? I know I could use BSMAP on the unmapped sequences (from human) and try and map them to other organisms, but that would take a longer time.

    Are there any other easy to use approaches I am missing out on ?
  • gandalf886
    Member
    • Feb 2013
    • 11

    #2
    I would just map the reads to the genome of potential contamination organism.
    I once got a RRBS library which is supposed to be human but the mapping efficiency is < 1%. I then tried to map to mouse and got 44% mapping efficiency. It turns out most of the DNA for library construction are contaminations from mouse feeder cells when we are making the human iPSC.

    Comment

    • medalofhonour
      Member
      • Jul 2011
      • 18

      #3
      Originally posted by gandalf886 View Post
      I would just map the reads to the genome of potential contamination organism.
      I once got a RRBS library which is supposed to be human but the mapping efficiency is < 1%. I then tried to map to mouse and got 44% mapping efficiency. It turns out most of the DNA for library construction are contaminations from mouse feeder cells when we are making the human iPSC.
      I tried mapping to mouse but no luck. My problem is that I don't really know potential contaminant organisms in this case.

      I will have to map to anything and everything to find out what could be the contaminant.

      Comment

      Latest Articles

      Collapse

      ad_right_rmr

      Collapse

      News

      Collapse

      Topics Statistics Last Post
      Started by SEQadmin2, 06-05-2026, 10:09 AM
      0 responses
      12 views
      0 reactions
      Last Post SEQadmin2  
      Started by SEQadmin2, 06-04-2026, 08:59 AM
      0 responses
      24 views
      0 reactions
      Last Post SEQadmin2  
      Started by SEQadmin2, 06-02-2026, 12:03 PM
      0 responses
      28 views
      0 reactions
      Last Post SEQadmin2  
      Started by SEQadmin2, 06-02-2026, 11:40 AM
      0 responses
      22 views
      0 reactions
      Last Post SEQadmin2  
      Working...