Unconfigured Ad

Collapse
X
 
  • Filter
  • Time
  • Show
Clear All
new posts
  • aquleaf
    Member
    • Mar 2010
    • 38

    ChIP-Seq Integration & Interpretation

    Hi. I have three ChIP-Seq data for the same sample. Peak calling was done separately. How to combine the peaks of these three experiments together?

    One more question, How to associate genes with these binding sites?

    Thanks in advance!
  • mudshark
    Senior Member
    • Jan 2009
    • 138

    #2
    these are technical replicates, right? So you could simply do the peak calling on the combined data. If you want to do the peak calling first the easiest - maybe not the best - way is to define an overlapping set of peaks.

    About the association, there are some tools around that can help you (i guess Galaxy). On the other hand that is not a too complicated job, isn't it? Calculating the distances to some gene features and stuff.

    Comment

    • ETHANol
      Senior Member
      • Feb 2010
      • 308

      #3
      I think this is what you are looking for:


      There are some other ideas on my blog at:
      Everything Ethan knows about biology - The Ethan-ome


      On the first question, I agree combining the data is a good way to go. You can also get the common peaks between the samples by using the BedTools intersectBed program. It is also available on Galaxy if you are not comfortable with the command line.
      --------------
      Ethan

      Comment

      • rory
        Member
        • Aug 2008
        • 28

        #4
        Are the replicates purely technical (mutliple lanes of the same library) or was the experiment repeated three times (three different libraries)? This makes a big difference. If it is just three lanes of the same library, then I agree the data should be merged and peaks called once. However without replicating the experiment, you will never be able to get reliable results -- you have no clue as to the variation inherent in the experiment, and whether your library represents an outlier.

        If the experiment was repeated three times, then you should not combine the reads, but rather work with the three peaksets. Start by checking their overlaps (3-way venn diagram). To minimize false positives, you could take only the peaks that all three replicates agree on, or be less stringent and take those that overlap in at least two of the three libraries. Another approach is to take all the peaks, then go back to the aligned reads and re-score each peak for each replicate (whether or not it was identified as a peak in that replicate).

        See also this paper for ideas on how to combine peaksets: http://projecteuclid.org/DPubS?servi...oas/1318514284

        If you are using R and Bioconductor, there are a number of packages that can help with this type of analysis. For example DiffBind helps with overlap and re-scoring, and ChIPpeakAnno will map peaks to genes.
        Last edited by rory; 12-02-2011, 07:51 AM. Reason: typo

        Comment

        Latest Articles

        Collapse

        • SEQadmin2
          Nine Things a Sample Prep Scientist Thinks About Before Sequencing
          by SEQadmin2


          I’m not a sequencing expert. I’m a purification scientist who uses NGS to evaluate workflows my group develops. With this perspective, we think about the sample first and the NGS workflow second. The sequencer is an exceptionally honest reporter, but it can only report on what you give it, so whether you get clean, interpretable data from an NGS workflow is largely determined before you begin.

          Here are nine questions we think about, in roughly the order they matter, before...
          06-18-2026, 07:11 AM
        • SEQadmin2
          From Collection to Sequencing: Why Sample Preparation and Preservation Define Sequencing Data
          by SEQadmin2


          Data variability is still an issue in sequencing technologies despite the advances in reproducibility and accuracy of these platforms. But the problem does not originate in the sequencing itself, but in the previous steps, before the sample reaches the sequencer.


          The first step is collection, followed by preservation and sample preparation for analysis. Most scientists overlook those steps, but not being careful might just be skewing the experiment’s results.
          ...
          06-02-2026, 10:05 AM

        ad_right_rmr

        Collapse

        News

        Collapse

        Topics Statistics Last Post
        Started by SEQadmin2, 06-26-2026, 11:10 AM
        0 responses
        12 views
        0 reactions
        Last Post SEQadmin2  
        Started by SEQadmin2, 06-17-2026, 06:09 AM
        0 responses
        48 views
        0 reactions
        Last Post SEQadmin2  
        Started by SEQadmin2, 06-09-2026, 11:58 AM
        0 responses
        106 views
        0 reactions
        Last Post SEQadmin2  
        Started by SEQadmin2, 06-05-2026, 10:09 AM
        0 responses
        125 views
        0 reactions
        Last Post SEQadmin2  
        Working...