Unconfigured Ad

Collapse
X
 
  • Filter
  • Time
  • Show
Clear All
new posts
  • NGS_New_User
    Member
    • Sep 2012
    • 41

    Gene Ontology analysis tools

    Hi Everyone,

    I desperately need your expertise advise on this

    I have a assembled a genome de novo, and I would like to run gene ontology analysis on it. I have around 300k contigs. What software/tools would you advise me to try and use? I tried the free version of Blast2GO but it had limitations such that it could not work with long contigs.
    Does the Blast2GO pro override that limitation?
    What other GO tools would do a similar thing?

    Thanks in advance
  • maubp
    Peter (Biopython etc)
    • Jul 2009
    • 1544

    #2
    Normally you'd assign GO terms to genes/proteins (typically short), not entire genome contigs (typically long).

    You should look at gene finding, and perhaps automated annotation in general.

    Comment

    • Hel
      Member
      • Jul 2013
      • 24

      #3
      Hi NGS_New_User,

      I also have the same question. I have a de novo assembled transcriptome to analyze in gene ontology terms. Please maubp, could you explain how to get genes/proteins from our data? Because I only have contigs (and unassembled reads)..

      Thanks in advance!
      Last edited by Hel; 12-10-2013, 04:22 AM.

      Comment

      • rhinoceros
        Senior Member
        • Apr 2013
        • 372

        #4
        Originally posted by Hel View Post
        Hi NGS_New_User,

        I also have the same question. I have a de novo assembled transcriptome to analyze in gene ontology terms. Please maubp, could you explain how to get genes/proteins from our data? Because I only have contigs (and unassembled reads)..

        Thank in advance!
        Why not:

        1. Predict proteins
        2. Blastp against nr (tabular output)
        3. Map to GO with this file
        4. Sort for best hits with a GO match
        Last edited by rhinoceros; 12-10-2013, 04:14 AM.
        savetherhino.org

        Comment

        • sphil
          Senior Member
          • Apr 2010
          • 192

          #5
          Originally posted by rhinoceros View Post
          Why not:

          1. Predict proteins
          2. Blastp against nr (tabular output)
          or
          2. Blastp against nr (XML output)
          3. load into Blast2GO
          4. do you analysis...

          prediction of proteins you can go with augustus but there are several other programs which are capable.

          Comment

          • mcnelson.phd
            Senior Member
            • Jul 2011
            • 162

            #6
            Originally posted by Hel View Post
            Hi NGS_New_User,

            I also have the same question. I have a de novo assembled transcriptome to analyze in gene ontology terms. Please maubp, could you explain how to get genes/proteins from our data? Because I only have contigs (and unassembled reads)..

            Thanks in advance!
            You'll need to get the nucleotide sequence of all possible ORFs that are found in your contigs. To do that you can use Glimmer, GeneMark, or a few other ORF callers although the first two are probably the most popular. Then you can get a multi-fasta of all the ORFs and run those through blast2GO.

            Comment

            • Hel
              Member
              • Jul 2013
              • 24

              #7
              Thanks to all . I'm sorry but I don´t understand everything.

              rhinoceros) Which tool to predict proteins?

              sphill) Well, Blast2GO already do the blast step, isn't it?

              mcnelson.phd) My contigs came from RNA-seq, so all my sequences have been expressed. Do you mean that I have to select the portion of the contigs that is expressed? This confuses me.

              Thanks in advance!

              Comment

              • maubp
                Peter (Biopython etc)
                • Jul 2009
                • 1544

                #8
                Originally posted by Hel View Post
                sphill) Well, Blast2GO already do the blast step, isn't it?
                Doing the BLAST online at the NCBI from within the Blast2GO tool is very slow. It is much faster to run the BLAST locally on a cluster, and import the BLAST results into Blast2GO for the annotation mapping step.
                Last edited by maubp; 12-12-2013, 06:14 AM. Reason: typo

                Comment

                • sphil
                  Senior Member
                  • Apr 2010
                  • 192

                  #9
                  Originally posted by maubp View Post
                  Doing the BLAST online at he NCBI from within the Blast2GO tool is very slow. It is much faster to run the BLAST locally on a cluster, and import the BLAST results into Blast2GO for the annotation mapping step.
                  Yep that is exactly why i should go with local blast and then blast2go.

                  Comment

                  Latest Articles

                  Collapse

                  • SEQadmin2
                    From Collection to Sequencing: Why Sample Preparation and Preservation Define Sequencing Data
                    by SEQadmin2


                    Data variability is still an issue in sequencing technologies despite the advances in reproducibility and accuracy of these platforms. But the problem does not originate in the sequencing itself, but in the previous steps, before the sample reaches the sequencer.


                    The first step is collection, followed by preservation and sample preparation for analysis. Most scientists overlook those steps, but not being careful might just be skewing the experiment’s results.
                    ...
                    06-02-2026, 10:05 AM
                  • SEQadmin2
                    Single-Cell Sequencing at an Inflection Point: Early Impacts of New Platforms and Emerging Trends
                    by SEQadmin2


                    With the launch of new single-cell sequencing platforms in 2026, the field stands at an exciting inflection point. This article surveys the most impactful advances in the field and discusses how they’re reshaping research in cancer, immunology, and beyond.


                    Introduction

                    Single-cell sequencing technologies have undergone remarkable advances over the past decade, transitioning from low-throughput experimental approaches to highly scalable platforms capable of...
                    05-22-2026, 06:42 AM
                  • SEQadmin2
                    Environmental Genomics in the Age of NGS: From Microbes to Conservation Strategies
                    by SEQadmin2

                    Studying ecosystems means dealing with complex, multi-species communities that are hard to observe at scale. This complexity, however, hides many important questions to be answered, from how biogeochemical cycles work and how climate change can affect species distribution to how conservation strategies can work best.


                    Genomics, particularly since the expansion of NGS, has transformed ecosystem ecology. By sequencing environmental DNA, we can now assess biodiversity without direct...
                    05-06-2026, 09:04 AM

                  ad_right_rmr

                  Collapse

                  News

                  Collapse

                  Topics Statistics Last Post
                  Started by SEQadmin2, Today, 08:59 AM
                  0 responses
                  10 views
                  0 reactions
                  Last Post SEQadmin2  
                  Started by SEQadmin2, 06-02-2026, 12:03 PM
                  0 responses
                  21 views
                  0 reactions
                  Last Post SEQadmin2  
                  Started by SEQadmin2, 06-02-2026, 11:40 AM
                  0 responses
                  17 views
                  0 reactions
                  Last Post SEQadmin2  
                  Started by SEQadmin2, 05-28-2026, 11:40 AM
                  0 responses
                  31 views
                  0 reactions
                  Last Post SEQadmin2  
                  Working...