Seqanswers Leaderboard Ad

Collapse

Announcement

Collapse
No announcement yet.
X
 
  • Filter
  • Time
  • Show
Clear All
new posts

  • Finding significantly different ChIP-seq peak length?

    Hello,

    I have a simple data analysis question.

    Say I have a protein ChIP-seq data comparing three replicates of wild-type and three replicates of mutants. I have normalized the data using spikes, mapped reads, etc and let's say for the sake of the argument the data is normalized.

    A typical pipeline would then call peak then count how many reads in those peaks then compare WT vs mutant with statistical tests (e.g. using MAnorm or DESeq2 or other softwares that's built to find differentially "expressed" ChIP-seq)

    The ideal scenario is like this below for example, both WT and MUT peaks overlap pretty nicely in a hotspot such that we can use the method above to find significantly up/down ChIP-seq peaks by using the signal.



    However, what if it's not the signal that we want to compare, but the length? Let's say this protein upon mutation are more spread around the hotspots instead of in the hotspots. Therefore the overall signal does not change, but the shape or length does. Worse, the length change is not consistent but can be to the left, or to the right, or both, as such:



    The wild-type peak is very consistent in shape and length and only mutants change.

    The goal is really to test whether there is a significant length increase/decrease compared to wild type. Whether it's go to the right, left, or both doesn't matter. What statistical test would be the best?

    What I did was I first call peak, then merged peaks that are close together in the 6 samples (I call these hotspots). Then for each sample I sum their peak length in each merged peak group. When I looked at the distribution, the length for each sample really follows poisson/nbinom therefore I used DESeq2 to find significantly different length. Would this be an acceptable method?

  • #2
    Hi- It's a good question...! When you use methods like Deseq you compress all the information in a peak in a single number: The count of reads (or the length, in your case). Have a look at this paper and associated R package "MMDiff: quantitative testing for shape changes in ChIP-Seq data sets".

    Dario

    Comment


    • #3
      For something like this you might want to test for differences in the distribution, such as with a KS test.
      edit: MMDiff looks MUCH more interesting!

      Comment


      • #4
        Originally posted by dpryan View Post
        For something like this you might want to test for differences in the distribution, such as with a KS test.
        edit: MMDiff looks MUCH more interesting!
        Trying MMdiff out for both Chip-Seq and BS-Seq data is on my todo list, but still haven't got around it!

        Comment


        • #5
          Thanks for the MMdiff suggestion! Will definitely try it out!!

          Comment

          Latest Articles

          Collapse

          • seqadmin
            Best Practices for Single-Cell Sequencing Analysis
            by seqadmin



            While isolating and preparing single cells for sequencing was historically the bottleneck, recent technological advancements have shifted the challenge to data analysis. This highlights the rapidly evolving nature of single-cell sequencing. The inherent complexity of single-cell analysis has intensified with the surge in data volume and the incorporation of diverse and more complex datasets. This article explores the challenges in analysis, examines common pitfalls, offers...
            06-06-2024, 07:15 AM
          • seqadmin
            Latest Developments in Precision Medicine
            by seqadmin



            Technological advances have led to drastic improvements in the field of precision medicine, enabling more personalized approaches to treatment. This article explores four leading groups that are overcoming many of the challenges of genomic profiling and precision medicine through their innovative platforms and technologies.

            Somatic Genomics
            “We have such a tremendous amount of genetic diversity that exists within each of us, and not just between us as individuals,”...
            05-24-2024, 01:16 PM

          ad_right_rmr

          Collapse

          News

          Collapse

          Topics Statistics Last Post
          Started by seqadmin, 06-07-2024, 06:58 AM
          0 responses
          13 views
          0 likes
          Last Post seqadmin  
          Started by seqadmin, 06-06-2024, 08:18 AM
          0 responses
          20 views
          0 likes
          Last Post seqadmin  
          Started by seqadmin, 06-06-2024, 08:04 AM
          0 responses
          20 views
          0 likes
          Last Post seqadmin  
          Started by seqadmin, 06-03-2024, 06:55 AM
          0 responses
          14 views
          0 likes
          Last Post seqadmin  
          Working...
          X