Seqanswers Leaderboard Ad

Collapse

Announcement

Collapse
No announcement yet.
X
 
  • Filter
  • Time
  • Show
Clear All
new posts

  • Mnase cause high multimapper fraction?

    Hello Community.

    I am working on histone modifications in zebrafish.
    I recently ran several samples for a histone methylation ChIP after MNase digestion.

    Now I run into the problem of having 60-80% of Multimappers in my mappable reads.
    Does anyone have the same problems or knows why that could happen?

    Some more specifications:
    I have "produced" this problem in 10 independent samples, plus in 4 Input samples (meaning that it is not the IP that enriches for Multimappers).

    Digestion by MNase was performed to have 80% Mono-and Dinucleosomes, but no over digestion (fragments smaller than 150bp)

    FastQC gives OK quality. There are some ambiguities with Kmer content, but other than that quality seems ok.

    I sequenced around 50Mio reads, from which 70-80% were mappable. However, from these, 60-80% were Multimappers. When analysing the Multimappers, they are distributed throughout the genome, and all "classes" of multiplicity is present.

    From that I reason that already the MNase digestion prefers repetitive sequences, but I have no idea why and how to prevent this.

    Does anyone has Input on this?

  • #2
    Did you try to report only uniquely mappable reads?

    Comment


    • #3
      Yes sure.
      IT makes 10-20% of the whole genome. However, in next experiments I do not want to waste 80% of my data on multimappers, if avoidable.

      As the problem appears also in the Input, I suspect the MNase digestion or clean-up to cause the problem. This is why I ask for help

      Comment


      • #4
        What is your definition of multimappers?

        Comment


        • #5
          That one single read maps to more than one position in the genome with the same confidence.

          Is there any other definition?

          Comment


          • #6
            I analysed some public MNase data recently for a large complex genome >2gb and had this issue too with 50bp reads. There appears to be a very high background in MNase data.

            Longer read lengths might partially mitigate the issue. However, if MNase indicates you are primarily sequencing complex repeats this problem will always occur if read lengths are shorter than the compound repeats. Interestingly, those using the data are very happy with it, especially the patterns around gene boundaries appear as expected.

            To practically deal with it I have produced two tracks, one with all reads and one filtered by samtools to only include Mapping Quality 20+ reads.

            Comment


            • #7
              Thank you all for your replies!

              Would you think that using paired-end seq might help?
              First of all it is likely that one of the two reads is not multi-mapper, and thus would give more confidence for the second read.
              Second, since one knows the possible distance between the reads, one can further increase mapping confidence.

              IS that an option?

              Comment

              Latest Articles

              Collapse

              • seqadmin
                Recent Advances in Sequencing Analysis Tools
                by seqadmin


                The sequencing world is rapidly changing due to declining costs, enhanced accuracies, and the advent of newer, cutting-edge instruments. Equally important to these developments are improvements in sequencing analysis, a process that converts vast amounts of raw data into a comprehensible and meaningful form. This complex task requires expertise and the right analysis tools. In this article, we highlight the progress and innovation in sequencing analysis by reviewing several of the...
                Today, 07:48 AM
              • seqadmin
                Essential Discoveries and Tools in Epitranscriptomics
                by seqadmin




                The field of epigenetics has traditionally concentrated more on DNA and how changes like methylation and phosphorylation of histones impact gene expression and regulation. However, our increased understanding of RNA modifications and their importance in cellular processes has led to a rise in epitranscriptomics research. “Epitranscriptomics brings together the concepts of epigenetics and gene expression,” explained Adrien Leger, PhD, Principal Research Scientist...
                04-22-2024, 07:01 AM

              ad_right_rmr

              Collapse

              News

              Collapse

              Topics Statistics Last Post
              Started by seqadmin, Today, 07:17 AM
              0 responses
              11 views
              0 likes
              Last Post seqadmin  
              Started by seqadmin, 05-02-2024, 08:06 AM
              0 responses
              19 views
              0 likes
              Last Post seqadmin  
              Started by seqadmin, 04-30-2024, 12:17 PM
              0 responses
              20 views
              0 likes
              Last Post seqadmin  
              Started by seqadmin, 04-29-2024, 10:49 AM
              0 responses
              28 views
              0 likes
              Last Post seqadmin  
              Working...
              X