Seqanswers Leaderboard Ad

Collapse

Announcement

Collapse
No announcement yet.
X
 
  • Filter
  • Time
  • Show
Clear All
new posts

  • diffReps - ChIP-seq differential analysis package

    Dear colleagues,

    I'd like to introduce you to diffReps -- a ChIP-seq differential analysis package that my group developed.

    A platoon of peak calling programs have been developed in the past few years to identify TF binding sites and chromatin modification sites under a single condition. However, the methods that are available to identify the differences between two conditions are surprisingly scarce. This is an important question because ChIP-seq is now routinely applied to in vivo tissues where two or more states need to be compared: be it disease vs. normal or laboratory treatment vs. control.

    This kind of problems has its unique challenges. Changes are often subtle and do not stand out as clear as peaks vs. background. Biological replicates are commonly used to increase the statistical power. It is critical to select the appropriate statistical test for this task. It is also crucial to handle the large ChIP-seq files as painless as possible.

    In the past 3-4 years, my group have handled hundres of ChIP-seq samples, the majority of which come from in vivo mouse brain. To handle this large amount of data, we decided to develop our own program because none of the method on the market appears to be satisfactory to our needs. The result is diffReps -- a program package that integrates data manipulation, statistical tests, downstream annotations, etc., all into one command line. It literally saved our lives and tons of time.

    diffReps is developed in PERL and it runs on all platforms such as Linux, Mac and Windows. It uses only modest RAM and finishes running in reasonable time, even for large files. It also integrates two very useful tools. One is called region analysis, a program that can annotate differential sites or peaks into genes or heterochromatic regions. The other is called hotspot finding, a program that can identify locations where the differential chromatin modifications happen more often than random.

    diffReps is shared as an open source project and can be found here:

    https://code.google.com/p/diffreps/.

    I'm also happy to announce that the manuscript about diffReps has been published by PLOS ONE and can be located here:

    ChIP-seq is increasingly being used for genome-wide profiling of histone modification marks. It is of particular importance to compare ChIP-seq data of two different conditions, such as disease vs. control, and identify regions that show differences in ChIP enrichment. We have developed a powerful and easy to use program, called diffReps, to detect those differential sites from ChIP-seq data, with or without biological replicates. In addition, we have developed two useful tools for ChIP-seq analysis in the diffReps package: one for the annotation of the differential sites and the other for finding chromatin modification “hotspots”. diffReps is developed in PERL programming language and runs on all platforms as a command line script. We tested diffReps on two different datasets. One is the comparison of H3K4me3 between two human cell lines from the ENCODE project. The other is the comparison of H3K9me3 in a discrete region of mouse brain between cocaine- and saline-treated conditions. The results indicated that diffReps is a highly sensitive program in detecting differential sites from ChIP-seq data.


    If you are working on ChIP-seq data and would like to compare two conditions, I suggest you to give diffReps a try. There is also a discussion group about diffReps, where you can post your usage questions or receive announcements:



    As we are still working to improve diffReps further, please sign up to become a member of the discussion group so that any future releases and news will be directed to your inbox.

    I hope you find diffReps to be useful to your research.

  • #2
    Your PLOS ONE paper only looks at histone modifications. Would diffReps be appropriate to use with transcription factors that don't generate wide peaks like histone modifications? Perhaps using a smaller window size and step size would be better in this situation?

    Would you recommend removing duplicate reads (ie. PCR duplicates) before running diffReps?

    Is the strand (+/-) required in the .bed file used for input, or just the first 3 columns (chromosome, chromStart, chromEnd)? I have paired end reads so I was thinking I could combine them, since it doesn't look like diffReps can take paired end reads. If I do this then strand is irrelevant, so I could leave it blank or just say "+" for everything.
    Last edited by biznatch; 08-06-2013, 01:35 PM.

    Comment


    • #3
      biznatch,

      Yes, you can certainly use diffReps on transcription factor ChIP-seq. I don't see any reason why it cannot be used on TF. Using smaller window size is recommended for TF data.

      And I would recommend removing PCR duplicates before diffReps, which provides a more conservative list.

      If you have paired-end data, you can manipulate them as you have already done and set the fragment size to 0. The strand info is used to shift short reads. Once you set the fragment size to 0, they become irrelevant.

      Comment

      Latest Articles

      Collapse

      • seqadmin
        Current Approaches to Protein Sequencing
        by seqadmin


        Proteins are often described as the workhorses of the cell, and identifying their sequences is key to understanding their role in biological processes and disease. Currently, the most common technique used to determine protein sequences is mass spectrometry. While still a valuable tool, mass spectrometry faces several limitations and requires a highly experienced scientist familiar with the equipment to operate it. Additionally, other proteomic methods, like affinity assays, are constrained...
        04-04-2024, 04:25 PM
      • seqadmin
        Strategies for Sequencing Challenging Samples
        by seqadmin


        Despite advancements in sequencing platforms and related sample preparation technologies, certain sample types continue to present significant challenges that can compromise sequencing results. Pedro Echave, Senior Manager of the Global Business Segment at Revvity, explained that the success of a sequencing experiment ultimately depends on the amount and integrity of the nucleic acid template (RNA or DNA) obtained from a sample. “The better the quality of the nucleic acid isolated...
        03-22-2024, 06:39 AM

      ad_right_rmr

      Collapse

      News

      Collapse

      Topics Statistics Last Post
      Started by seqadmin, 04-11-2024, 12:08 PM
      0 responses
      25 views
      0 likes
      Last Post seqadmin  
      Started by seqadmin, 04-10-2024, 10:19 PM
      0 responses
      28 views
      0 likes
      Last Post seqadmin  
      Started by seqadmin, 04-10-2024, 09:21 AM
      0 responses
      24 views
      0 likes
      Last Post seqadmin  
      Started by seqadmin, 04-04-2024, 09:00 AM
      0 responses
      52 views
      0 likes
      Last Post seqadmin  
      Working...
      X