Seqanswers Leaderboard Ad

Collapse

Announcement

Collapse
No announcement yet.
X
 
  • Filter
  • Time
  • Show
Clear All
new posts

  • GO enrichment between two samples

    Dear all,

    I want to study the GO enrichment between two samples (they differ in age) of a particular species.
    I have two sets of data:
    1) list of differentially expressed genes in sample 1 along with the GOs associated with each gene
    1) list of differentially expressed genes in sample 2 along with the GOs associated with each gene

    How can I study which enriched GOs between the two sets? do I need to have background genes (i.e. non differentially expressed genes)?
    is there a tool that do that? (FYI: this species is not supported by most of GO analysis tools like Gorilla).

  • #2
    Since you don't mention the species, it is hard to know what available or pre-built tools, like DAVID would be useful.

    There are tools in BioConductor to compute a hypergeometric enrichment test for any set of gene lists (ie. library GOstats). So, if you have lists of genes of interest, what you then need is a list of the gene "universe" you wish to compare them to in your test for significant enrichment and a list of GO terms associated with them (an ontology in other words).

    Do you have, or can you put together a gene universe and a (custom or otherwise) ontology for this species? If so, there is a slide deck under the courses at the BioConductor web site that lays out how to perform a hypergeometric enrichment analysis of those three things, namely:

    1. A species-specific Gene "universe"
    2. Your ontology, or GO categories (categorize genes by GO terms).
    3. A list of interesting genes (e.g. differentially expressed genes based on your significance testing)
    Michael Black, Ph.D.
    ScitoVation LLC. RTP, N.C.

    Comment


    • #3
      Thanks Michael for your quick reply!.

      My species is a type of honey bee. So, I am wondering if DAVID would be useful in this case? shall I paste the GOs of one sample as target and the other as background? or I put the uniprot IDs instead of GOs?

      regarding your suggestion of Bioconductor GOstats tool. Yes. I can get the set gene universe (i.e. background set), but, let me ask what do we mean by the gene universe? is it the set of all genes in the annotation for this species? the set of all expressed genes in this species? or simply it is the background for both sets that I have (i.e. the non differentially expressed genes for each of the two samples?)?

      I tried to find this slide deck but still unable to find it. Can you put the link for it?

      Thanks very much

      Comment


      • #4
        You can certainly try running your list of differentially expressed gene IDs through DAVID and see what you get.

        For R, the universe of genes would be all known/annotated coding genes for your species. And you would want the known GO terms associated with them. It could end up being a bit of a chore to put a decent custom ontology together to perform enrichment with, unless the information is somewhat readily available for honeybee somewhere.

        You can also use R tools to just take your list of differentially expressed gene IDs and perform enrichment against all of the public GO. The entire public GO database is available in R already.
        Michael Black, Ph.D.
        ScitoVation LLC. RTP, N.C.

        Comment


        • #5
          Give Blast2GO a try.

          Comment

          Latest Articles

          Collapse

          • seqadmin
            Strategies for Sequencing Challenging Samples
            by seqadmin


            Despite advancements in sequencing platforms and related sample preparation technologies, certain sample types continue to present significant challenges that can compromise sequencing results. Pedro Echave, Senior Manager of the Global Business Segment at Revvity, explained that the success of a sequencing experiment ultimately depends on the amount and integrity of the nucleic acid template (RNA or DNA) obtained from a sample. “The better the quality of the nucleic acid isolated...
            03-22-2024, 06:39 AM
          • seqadmin
            Techniques and Challenges in Conservation Genomics
            by seqadmin



            The field of conservation genomics centers on applying genomics technologies in support of conservation efforts and the preservation of biodiversity. This article features interviews with two researchers who showcase their innovative work and highlight the current state and future of conservation genomics.

            Avian Conservation
            Matthew DeSaix, a recent doctoral graduate from Kristen Ruegg’s lab at The University of Colorado, shared that most of his research...
            03-08-2024, 10:41 AM

          ad_right_rmr

          Collapse

          News

          Collapse

          Topics Statistics Last Post
          Started by seqadmin, Yesterday, 06:37 PM
          0 responses
          7 views
          0 likes
          Last Post seqadmin  
          Started by seqadmin, Yesterday, 06:07 PM
          0 responses
          7 views
          0 likes
          Last Post seqadmin  
          Started by seqadmin, 03-22-2024, 10:03 AM
          0 responses
          49 views
          0 likes
          Last Post seqadmin  
          Started by seqadmin, 03-21-2024, 07:32 AM
          0 responses
          66 views
          0 likes
          Last Post seqadmin  
          Working...
          X