Header Leaderboard Ad

Collapse

Convert GO numbers to GO terms

Collapse

Announcement

Collapse
No announcement yet.
X
 
  • Filter
  • Time
  • Show
Clear All
new posts

  • Convert GO numbers to GO terms

    Hi all!

    Which programme is most suitable for converting a list of GO numbers to their corresponding GO terms (including assignment to MF, CC, BP and lvl)?

    I have something like:
    73562
    01276
    89003
    00271
    ...
    ...
    ...

    And I need an output like
    73562 --> MF: lvl3: oxygen transporter activity
    01276 --> CC: lvl2: mitochondrial membrane
    89003 --> CC: lvl3: inner mitochondrial membrane
    00271 --> BP: lvl4: oxygen detoxification
    ...
    ...
    ...

    Thanks for hints!

  • #2
    Hi Ramet,

    At this link you can find txt files with the info you need, although I'm not sure what the "lvl" is...
    The Gene Ontology (GO) project is a major bioinformatics initiative to develop a computational representation of our evolving knowledge of how genes encode biological functions at the molecular, cellular and tissue system levels.


    You can write a small perl script to parse and combine this with your own file.

    -Stef

    Comment


    • #3
      I think, the key is the caculation of level!

      Comment


      • #4
        Another option is to create a local instance of the MySQL database and use an SQL query to map ids to descriptions. Instructions for doing this are here
        http://www.geneontology.org/GO.datab...build-your-own

        Comment


        • #5
          Yes calculating the level (or 'depth') of a term is the main challenge to obtain the information 'Ramet' seeks. A large number of MySQL queries of the GO schema are discussed at the following URL. Included on this page are various mentions of the difficulty and caveats associated with depth. Each term can have multiple associations to other terms. This means that one term may have more than one depth depending on the path you take through the tree. You are therefore faced with the problem of deciding what depth to use (average, minimum, maximum?). This all leads to the questions: Do you need the level? Does it really mean what you hope it does?

          From the GO documentation: "The depth of a node or distance between nodes is not necessarily biologically meaningful. Other more advanced queries are required to get biologically meaningful information-theory based distances between terms "

          http://wiki.geneontology.org/index.p...terms_by_GO_ID
          Good luck!

          Comment


          • #6
            There's quite a nice Perl API to the GO databases at:

            http://search.cpan.org/~cmungall/go-perl/go-perl.pod

            The example code at the top pretty much does what you're after.

            Comment


            • #7
              Thanks to everyone!

              You have convinced me that I maybe do not need the level (depth). Thanks for that conceptual help!

              And parsing the textfiles from http://www.geneontology.org/GO.downloads.files.shtml with a perl script seems to be the solution for my problem.
              Shame on me to have missed them.

              I'll later on will have a look on the MySQL database solution and then decide which way to go.

              Again, thank you very much!

              Comment

              Latest Articles

              Collapse

              • seqadmin
                How RNA-Seq is Transforming Cancer Studies
                by seqadmin



                Cancer research has been transformed through numerous molecular techniques, with RNA sequencing (RNA-seq) playing a crucial role in understanding the complexity of the disease. Maša Ivin, Ph.D., Scientific Writer at Lexogen, and Yvonne Goepel Ph.D., Product Manager at Lexogen, remarked that “The high-throughput nature of RNA-seq allows for rapid profiling and deep exploration of the transcriptome.” They emphasized its indispensable role in cancer research, aiding in biomarker...
                09-07-2023, 11:15 PM
              • seqadmin
                Methods for Investigating the Transcriptome
                by seqadmin




                Ribonucleic acid (RNA) represents a range of diverse molecules that play a crucial role in many cellular processes. From serving as a protein template to regulating genes, the complex processes involving RNA make it a focal point of study for many scientists. This article will spotlight various methods scientists have developed to investigate different RNA subtypes and the broader transcriptome.

                Whole Transcriptome RNA-seq
                Whole transcriptome sequencing...
                08-31-2023, 11:07 AM

              ad_right_rmr

              Collapse

              News

              Collapse

              Topics Statistics Last Post
              Started by seqadmin, Yesterday, 06:18 AM
              0 responses
              5 views
              0 likes
              Last Post seqadmin  
              Started by seqadmin, 09-20-2023, 09:17 AM
              0 responses
              8 views
              0 likes
              Last Post seqadmin  
              Started by seqadmin, 09-19-2023, 09:23 AM
              0 responses
              25 views
              0 likes
              Last Post seqadmin  
              Started by seqadmin, 09-19-2023, 09:14 AM
              0 responses
              7 views
              0 likes
              Last Post seqadmin  
              Working...
              X