Unconfigured Ad

Collapse
X
 
  • Filter
  • Time
  • Show
Clear All
new posts
  • Ramet
    Junior Member
    • Sep 2010
    • 9

    Convert GO numbers to GO terms

    Hi all!

    Which programme is most suitable for converting a list of GO numbers to their corresponding GO terms (including assignment to MF, CC, BP and lvl)?

    I have something like:
    73562
    01276
    89003
    00271
    ...
    ...
    ...

    And I need an output like
    73562 --> MF: lvl3: oxygen transporter activity
    01276 --> CC: lvl2: mitochondrial membrane
    89003 --> CC: lvl3: inner mitochondrial membrane
    00271 --> BP: lvl4: oxygen detoxification
    ...
    ...
    ...

    Thanks for hints!
  • svl
    Member
    • Sep 2009
    • 43

    #2
    Hi Ramet,

    At this link you can find txt files with the info you need, although I'm not sure what the "lvl" is...
    The Gene Ontology (GO) project is a major bioinformatics initiative to develop a computational representation of our evolving knowledge of how genes encode biological functions at the molecular, cellular and tissue system levels.


    You can write a small perl script to parse and combine this with your own file.

    -Stef

    Comment

    • robertorun
      Member
      • Nov 2009
      • 22

      #3
      I think, the key is the caculation of level!

      Comment

      • malachig
        Senior Member
        • Aug 2010
        • 117

        #4
        Another option is to create a local instance of the MySQL database and use an SQL query to map ids to descriptions. Instructions for doing this are here

        Comment

        • malachig
          Senior Member
          • Aug 2010
          • 117

          #5
          Yes calculating the level (or 'depth') of a term is the main challenge to obtain the information 'Ramet' seeks. A large number of MySQL queries of the GO schema are discussed at the following URL. Included on this page are various mentions of the difficulty and caveats associated with depth. Each term can have multiple associations to other terms. This means that one term may have more than one depth depending on the path you take through the tree. You are therefore faced with the problem of deciding what depth to use (average, minimum, maximum?). This all leads to the questions: Do you need the level? Does it really mean what you hope it does?

          From the GO documentation: "The depth of a node or distance between nodes is not necessarily biologically meaningful. Other more advanced queries are required to get biologically meaningful information-theory based distances between terms "


          Good luck!

          Comment

          • simonandrews
            Simon Andrews
            • May 2009
            • 870

            #6
            There's quite a nice Perl API to the GO databases at:



            The example code at the top pretty much does what you're after.

            Comment

            • Ramet
              Junior Member
              • Sep 2010
              • 9

              #7
              Thanks to everyone!

              You have convinced me that I maybe do not need the level (depth). Thanks for that conceptual help!

              And parsing the textfiles from http://www.geneontology.org/GO.downloads.files.shtml with a perl script seems to be the solution for my problem.
              Shame on me to have missed them.

              I'll later on will have a look on the MySQL database solution and then decide which way to go.

              Again, thank you very much!

              Comment

              Latest Articles

              Collapse

              • SEQadmin2
                From Collection to Sequencing: Why Sample Preparation and Preservation Define Sequencing Data
                by SEQadmin2


                Data variability is still an issue in sequencing technologies despite the advances in reproducibility and accuracy of these platforms. But the problem does not originate in the sequencing itself, but in the previous steps, before the sample reaches the sequencer.


                The first step is collection, followed by preservation and sample preparation for analysis. Most scientists overlook those steps, but not being careful might just be skewing the experiment’s results.
                ...
                06-02-2026, 10:05 AM
              • SEQadmin2
                Single-Cell Sequencing at an Inflection Point: Early Impacts of New Platforms and Emerging Trends
                by SEQadmin2


                With the launch of new single-cell sequencing platforms in 2026, the field stands at an exciting inflection point. This article surveys the most impactful advances in the field and discusses how they’re reshaping research in cancer, immunology, and beyond.


                Introduction

                Single-cell sequencing technologies have undergone remarkable advances over the past decade, transitioning from low-throughput experimental approaches to highly scalable platforms capable of...
                05-22-2026, 06:42 AM
              • SEQadmin2
                Environmental Genomics in the Age of NGS: From Microbes to Conservation Strategies
                by SEQadmin2

                Studying ecosystems means dealing with complex, multi-species communities that are hard to observe at scale. This complexity, however, hides many important questions to be answered, from how biogeochemical cycles work and how climate change can affect species distribution to how conservation strategies can work best.


                Genomics, particularly since the expansion of NGS, has transformed ecosystem ecology. By sequencing environmental DNA, we can now assess biodiversity without direct...
                05-06-2026, 09:04 AM

              ad_right_rmr

              Collapse

              News

              Collapse

              Topics Statistics Last Post
              Started by SEQadmin2, 06-02-2026, 12:03 PM
              0 responses
              19 views
              0 reactions
              Last Post SEQadmin2  
              Started by SEQadmin2, 06-02-2026, 11:40 AM
              0 responses
              14 views
              0 reactions
              Last Post SEQadmin2  
              Started by SEQadmin2, 05-28-2026, 11:40 AM
              0 responses
              29 views
              0 reactions
              Last Post SEQadmin2  
              Started by SEQadmin2, 05-26-2026, 10:12 AM
              0 responses
              31 views
              0 reactions
              Last Post SEQadmin2  
              Working...