Unconfigured Ad

Collapse
X
 
  • Filter
  • Time
  • Show
Clear All
new posts
  • gigigou
    Member
    • May 2012
    • 31

    The distribution of peak in ChIP-seq on function element

    I have got some peaks from a ChIP-seq data.
    Now I'm trying to locate these peaks on the function elements.
    But I come up with a question while doing this.
    That is a peak may be extreme long that it covers many function elements.
    for example, let ~ represents peak, === represents exon, ------ represents intron

    ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~
    =======--------------================-----------=========

    In this case, which element should I chose?
    Thank you!
  • mudshark
    Senior Member
    • Jan 2009
    • 138

    #2
    if it covers introns and exons it basically covers genes, your feature definition might be too fine grained.

    Comment

    • dariober
      Senior Member
      • May 2010
      • 311

      #3
      Originally posted by gigigou View Post
      In this case, which element should I chose?
      I don't think there's a clear cut answer to this question. It probably depends on the particular mark you are looking at (histones, transcription factors, methylations, etc...) and on the particular gene or genomic region as well as the technical resolution you can achieve (your peaks can be very wide because of the limitations of the technique and/or beacuse the mark is really that large.)

      Searching for a one-to-one relation between peak and genomic feature will make downstream analyses and interpretations easier. But does it make sense biologically? I guess it is perfectly possible for a peak (histone, transcription factor...) to span and control several elements (genes, promoters etc...).

      If you could give more detail about your experiment I think you could get better answers, especially from biologists.

      Good luck!
      Dario

      Comment

      • mudshark
        Senior Member
        • Jan 2009
        • 138

        #4
        if there is a clear peak summit try to use this position for getting a one-to-one relationship

        Comment

        • gigigou
          Member
          • May 2012
          • 31

          #5
          Originally posted by dariober View Post
          I don't think there's a clear cut answer to this question. It probably depends on the particular mark you are looking at (histones, transcription factors, methylations, etc...) and on the particular gene or genomic region as well as the technical resolution you can achieve (your peaks can be very wide because of the limitations of the technique and/or beacuse the mark is really that large.)

          Searching for a one-to-one relation between peak and genomic feature will make downstream analyses and interpretations easier. But does it make sense biologically? I guess it is perfectly possible for a peak (histone, transcription factor...) to span and control several elements (genes, promoters etc...).

          If you could give more detail about your experiment I think you could get better answers, especially from biologists.

          Good luck!
          Dario
          Thank you very much.
          I'm trying to build an analyze pipeline for ChIP-seq data. It doesn't focus on a specific experiment. So I have to figure out a solution which can get the one-to-one relationship.
          Maybe mudshark's suggestion can be a instruction.

          Comment

          • gigigou
            Member
            • May 2012
            • 31

            #6
            Originally posted by mudshark View Post
            if there is a clear peak summit try to use this position for getting a one-to-one relationship
            Thank you!
            I'll try your suggestion.
            But I have a question, this will lose much infomation, doesn't it?
            The summit should be highly credible as it represents the whole peak.

            Comment

            • DZhang
              Senior Member
              • Jun 2010
              • 177

              #7
              Please be aware of the dangers using a fixed pipelines for different experiments. As it has been said by the previous posts, different biological questions may require different choice of features. I'd build several options in the pipeline to handle a few common scenarios. Most important of all, talk to whomever is going to use your pipelines first before making a decision.

              Comment

              • gigigou
                Member
                • May 2012
                • 31

                #8
                Originally posted by DZhang View Post
                Please be aware of the dangers using a fixed pipelines for different experiments. As it has been said by the previous posts, different biological questions may require different choice of features. I'd build several options in the pipeline to handle a few common scenarios. Most important of all, talk to whomever is going to use your pipelines first before making a decision.
                Your words are absolutely right. I'm going to set some options.
                BTW, is there any tool finding genes according to peak coordinates?
                I'm currently using PeakAnalyzer, but some of the genes found by it cann't be recognized by GoTermFinder.
                Thank you!

                Comment

                • dariober
                  Senior Member
                  • May 2010
                  • 311

                  #9
                  Originally posted by mudshark View Post
                  if there is a clear peak summit try to use this position for getting a one-to-one relationship
                  In addition you (gigigou) could set a "priority cue" of genomic features to further restrict the annotation. This list could (and should) be defined by the end user of the pipeline. For example, a cue roughly suitable for a transcription factor could be:

                  TSS > 5'UTR > exon > 3'UTR > intron > intergenic

                  So if a peak overlaps equally well a transcription start site (TSS, possibly extended upstream by n bases) and an intron, choose the TSS as annotation.

                  Dario

                  Comment

                  • DZhang
                    Senior Member
                    • Jun 2010
                    • 177

                    #10
                    gigigou,

                    As to PeakAnalyzer vs GoTermFinder, I am not sure if the problem resides with your annotation, PeaAnalyzer or GOTermFinder. You need to examine the genes that cannot be recognized by GOTermFinder more carefully. As always, please make sure your gene annotation and GO terms match. (An old gene annotation may not work well with GO terms with the latest gene annotation.)

                    Regards,
                    Douglas

                    Comment

                    • gigigou
                      Member
                      • May 2012
                      • 31

                      #11
                      Originally posted by dariober View Post
                      In addition you (gigigou) could set a "priority cue" of genomic features to further restrict the annotation. This list could (and should) be defined by the end user of the pipeline. For example, a cue roughly suitable for a transcription factor could be:

                      TSS > 5'UTR > exon > 3'UTR > intron > intergenic

                      So if a peak overlaps equally well a transcription start site (TSS, possibly extended upstream by n bases) and an intron, choose the TSS as annotation.

                      Dario
                      This is an excellent idea.
                      I think I need to figure out the "priority cue " first as I'm a green fingure in ChIP-seq.
                      Thank you.

                      Comment

                      • gigigou
                        Member
                        • May 2012
                        • 31

                        #12
                        Originally posted by DZhang View Post
                        gigigou,

                        As to PeakAnalyzer vs GoTermFinder, I am not sure if the problem resides with your annotation, PeaAnalyzer or GOTermFinder. You need to examine the genes that cannot be recognized by GOTermFinder more carefully. As always, please make sure your gene annotation and GO terms match. (An old gene annotation may not work well with GO terms with the latest gene annotation.)

                        Regards,
                        Douglas
                        www.contigexpress.com
                        I have checked the unrecognized genes on GO website. They are indeed exist. And the gene annotation database used by GOTermFinder is updated as well. I'll look into this problem in other aspects.
                        Thank you very much!

                        Comment

                        • DZhang
                          Senior Member
                          • Jun 2010
                          • 177

                          #13
                          Please share your findings in the forum.

                          Regards,
                          Douglas

                          Comment

                          Latest Articles

                          Collapse

                          • SEQadmin2
                            From Collection to Sequencing: Why Sample Preparation and Preservation Define Sequencing Data
                            by SEQadmin2


                            Data variability is still an issue in sequencing technologies despite the advances in reproducibility and accuracy of these platforms. But the problem does not originate in the sequencing itself, but in the previous steps, before the sample reaches the sequencer.


                            The first step is collection, followed by preservation and sample preparation for analysis. Most scientists overlook those steps, but not being careful might just be skewing the experiment’s results.
                            ...
                            06-02-2026, 10:05 AM
                          • SEQadmin2
                            Single-Cell Sequencing at an Inflection Point: Early Impacts of New Platforms and Emerging Trends
                            by SEQadmin2


                            With the launch of new single-cell sequencing platforms in 2026, the field stands at an exciting inflection point. This article surveys the most impactful advances in the field and discusses how they’re reshaping research in cancer, immunology, and beyond.


                            Introduction

                            Single-cell sequencing technologies have undergone remarkable advances over the past decade, transitioning from low-throughput experimental approaches to highly scalable platforms capable of...
                            05-22-2026, 06:42 AM
                          • SEQadmin2
                            Environmental Genomics in the Age of NGS: From Microbes to Conservation Strategies
                            by SEQadmin2

                            Studying ecosystems means dealing with complex, multi-species communities that are hard to observe at scale. This complexity, however, hides many important questions to be answered, from how biogeochemical cycles work and how climate change can affect species distribution to how conservation strategies can work best.


                            Genomics, particularly since the expansion of NGS, has transformed ecosystem ecology. By sequencing environmental DNA, we can now assess biodiversity without direct...
                            05-06-2026, 09:04 AM

                          ad_right_rmr

                          Collapse

                          News

                          Collapse

                          Topics Statistics Last Post
                          Started by SEQadmin2, Today, 08:59 AM
                          0 responses
                          8 views
                          0 reactions
                          Last Post SEQadmin2  
                          Started by SEQadmin2, 06-02-2026, 12:03 PM
                          0 responses
                          21 views
                          0 reactions
                          Last Post SEQadmin2  
                          Started by SEQadmin2, 06-02-2026, 11:40 AM
                          0 responses
                          15 views
                          0 reactions
                          Last Post SEQadmin2  
                          Started by SEQadmin2, 05-28-2026, 11:40 AM
                          0 responses
                          29 views
                          0 reactions
                          Last Post SEQadmin2  
                          Working...