Unconfigured Ad

Collapse
X
 
  • Filter
  • Time
  • Show
Clear All
new posts
  • lwhitmore
    Member
    • Aug 2013
    • 70

    MACS Chip Seq Biological Replicates

    Hey All,
    I have 2 biological replicates for a treatment chip seq experiment. When I analyze these 2 replicates independently (using MACS 1.4.2) I get about 3000 peaks identified for both replicates however when I merge them together I only get about 2000. How exactly does merging the replicates together change how MACS identifies peaks?
    Any help would be appreciated
    Leanne
  • jennyal
    Junior Member
    • Dec 2014
    • 3

    #2
    The same 'problem'

    Hi!
    I wonder if you got an answer to your problem with 'loosing' peaks when merging files?
    We are experiencing the same thing.

    Thanks!
    Jenny

    Comment

    • lwhitmore
      Member
      • Aug 2013
      • 70

      #3
      Hi,
      Unfortunately I did not ever get an answer to that question.

      Sorry, If you end up finding out why you should let me know

      Leanne

      Comment

      • rory
        Member
        • Aug 2008
        • 28

        #4
        I'll ChIP in on this one...

        In general, it is not a good idea to combine the reads from biological replicates. One reason is that you lose the ability to distinguish the biological and experimental variability inherent in ChIP experiments. This is similar to the reason you wouldn't combine all the reads from biological replicates in an RNA-seq experiment. Remember we expect ChIP experiments to exhibit even greater variability than RNA!

        The other reason is that it tends to break peak calling, as you discovered. Peak calling involves separating signal (peaks) from noise (background). In a ChIP, the majority of reads are background. So when you combine replicate data, the background levels all get higher and/or more filled in, which makes it in general more difficult to distinguish real peaks. If the replicates lined up cleanly -- for example technical replicates -- the peaks would fill in as well, but the biological and experimental variance means that the peaks just tend to either be a) relatively "lower"/less dense or b) less well-formed (messier "peaks"). The good aspect of this is that any peaks that are able to be identified (your reduced set of 2000) should have a lower false discovery rate.

        For a two-replicate experiment, you can follow the ENCODE guidelines to call peaks separately and combine them into a low-FDR set using the IDR calculation. I'd expect this to give even less than 2000 peaks as it can be quite a stringent test. It may also help to look at the overlap of the two peaksets to see how similar the 3,000 peaks from each replicate are.

        Cheers-
        Rory

        Comment

        • jennyal
          Junior Member
          • Dec 2014
          • 3

          #5
          Ok, at least I know now. Of course I'll let you know if I come across the answer!

          BW

          Jenny

          Comment

          • jennyal
            Junior Member
            • Dec 2014
            • 3

            #6
            Oh, Rory, I only saw you reply after I posted mine.

            Thanks for the info :-)

            BW

            Jenny

            Comment

            • Irem
              Junior Member
              • Sep 2014
              • 1

              #7
              Hi Rory,

              Could you please tell me how to combine the peaks from replicates into a low-FDR set using the IDR calculation?

              Thanks,

              Irem

              Comment

              • rory
                Member
                • Aug 2008
                • 28

                #8
                This page, "ENCODE: TF ChIP-seq peak calling using the Irreproducibility Discovery Rate (IDR) framework" , contains the instructions referenced in the "Software Tools Used to Create the ENCODE Resource" page on the UCSC browser...

                -R

                Comment

                • lwhitmore
                  Member
                  • Aug 2013
                  • 70

                  #9
                  Rory,
                  I looked at the page you mentioned and that program specifically says if you used macs14 that you should not use it to calculate the IDR. Do you know of any programs that can accurately calculate the IDR for peaks found by mac14?

                  Thanks
                  Leanne

                  Comment

                  Latest Articles

                  Collapse

                  • SEQadmin2
                    Nine Things a Sample Prep Scientist Thinks About Before Sequencing
                    by SEQadmin2


                    I’m not a sequencing expert. I’m a purification scientist who uses NGS to evaluate workflows my group develops. With this perspective, we think about the sample first and the NGS workflow second. The sequencer is an exceptionally honest reporter, but it can only report on what you give it, so whether you get clean, interpretable data from an NGS workflow is largely determined before you begin.


                    Here are nine questions we think about, in roughly the order they matter, before...
                    Yesterday, 07:11 AM
                  • SEQadmin2
                    From Collection to Sequencing: Why Sample Preparation and Preservation Define Sequencing Data
                    by SEQadmin2


                    Data variability is still an issue in sequencing technologies despite the advances in reproducibility and accuracy of these platforms. But the problem does not originate in the sequencing itself, but in the previous steps, before the sample reaches the sequencer.


                    The first step is collection, followed by preservation and sample preparation for analysis. Most scientists overlook those steps, but not being careful might just be skewing the experiment’s results.
                    ...
                    06-02-2026, 10:05 AM
                  • SEQadmin2
                    Single-Cell Sequencing at an Inflection Point: Early Impacts of New Platforms and Emerging Trends
                    by SEQadmin2


                    With the launch of new single-cell sequencing platforms in 2026, the field stands at an exciting inflection point. This article surveys the most impactful advances in the field and discusses how they’re reshaping research in cancer, immunology, and beyond.


                    Introduction

                    Single-cell sequencing technologies have undergone remarkable advances over the past decade, transitioning from low-throughput experimental approaches to highly scalable platforms capable of...
                    05-22-2026, 06:42 AM

                  ad_right_rmr

                  Collapse

                  News

                  Collapse

                  Topics Statistics Last Post
                  Started by SEQadmin2, 06-17-2026, 06:09 AM
                  0 responses
                  20 views
                  0 reactions
                  Last Post SEQadmin2  
                  Started by SEQadmin2, 06-09-2026, 11:58 AM
                  0 responses
                  38 views
                  0 reactions
                  Last Post SEQadmin2  
                  Started by SEQadmin2, 06-05-2026, 10:09 AM
                  0 responses
                  44 views
                  0 reactions
                  Last Post SEQadmin2  
                  Started by SEQadmin2, 06-04-2026, 08:59 AM
                  0 responses
                  49 views
                  0 reactions
                  Last Post SEQadmin2  
                  Working...