Seqanswers Leaderboard Ad

Collapse

Announcement

Collapse
No announcement yet.
X
 
  • Filter
  • Time
  • Show
Clear All
new posts

  • How to download gene annotation from NCBI?

    The NCBI Map Viewer has the latest pig genome build and shows the
    locations of all the genes. I would like to download this gene
    annotation so I can load it into my own GBrowse genome browser.
    So I need the NCBI gene annotation for the latest pig genome build in
    gff3 format, and the way to do it seems to be to download an asn.1
    file from NCBI, convert it to genbank format, and then use the bioperl
    script bp_genbank2gff3.pl to convert from genbank to gff3.

    I downloaded the gene annotation for the pig genome from the NCBI ftp site at
    ftp://ftp.ncbi.nlm.nih.gov/gene/DATA..._scrofa.ags.gz

    I downloaded the asn2gb conversion program from
    ftp://ftp.ncbi.nlm.nih.gov/asn1-conv...latform/linux/

    I run ./linux.asn2gb -i Sus_scrofa.ags -b T
    and get the error "Asn io_failure for input file 'Sus_scrofa.ags'"
    I've tried all the options for the -a and -t flags without luck.

    I'm able to convert the Sus_scrofa.ags file to xml format using the
    gene2xml program, but I don't know of any tool that can convert from
    XML to gff3.
    I downloaded a genbank format file of pig genes from
    ftp://ftp.ncbi.nlm.nih.gov/genomes/S...RNA/rna.gbk.gz but the
    file doesn't give chromosome coordinates for the genes, so I can't
    make a gff3 file out of it.

    Any pointers on how to use the asn tools properly, or how to get NCBI
    annotation in gff format in general, would be much appreciated.

    Thanks

    -John

  • #2
    I managed to run:

    ./gene2xml.linux -i Sus_scrofa.ags -b T -c T

    This prints XML output. Strangely Sus_scrofa.ags had to be gzipped and named Sus_scrofa.ags.gz.

    XML to gff convertion should be fairly easy, but I do not know a tool yet. You may check:

    Comment


    • #3
      John,

      I'm running into the same problem that you had. The NCBI Sus scrofa genome FTP site provides .asn, .fa, .gbk, .gbs, and .mfa files for each chromosome (last updated 10-12-2011).

      Were you able to convert the .asn data to .gff3 or .gtf format for annotation? I'd be interested to hear the best method you found for generating the annotation file that corresponds to the most recent S. scrofa genome.

      Thanks in advance,
      jjw

      Comment


      • #4
        I was not able to figure out how to convert any of the NCBI annotation data into a usable form. I sent an email to NCBI but didn't get a useful reply from them. Thankfully another group has generated a good gene build for Sscr10.2. As described here: http://animalgenome.org/pig/newsletter/No.110.html, you can download annotation at this site: http://gbi.agrsci.dk/pig/sscrofa10_2_annotation/
        Alternatively, Ensembl is running 10.2 through their pipeline and should have a gene build available in two or three months. If you can wait that long that would be another good alternative to NCBI's annotation.

        Comment


        • #5
          Thanks for the quick reply, John.

          No doubt, you've saved me a lot of frustration. I appreciate it.

          jjw

          Comment


          • #6
            Originally posted by jgarbe View Post
            Any pointers on how to use the asn tools properly, or how to get NCBI annotation in gff format in general, would be much appreciated.
            The NCBI are currently revising all their GFF3 output (it hadn't been compliant with the standards), so this should be much easier now/soon.

            Try ftp://ftp.ncbi.nlm.nih.gov/genomes/Sus_scrofa/GFF/ for the NCBI RefSeq annotation of pig Sscrofa10.2

            Comment


            • #7
              Thanks Peter,

              I'll take a look.

              jjw14

              Comment


              • #8
                Hi ,all
                I need buffalo gff or.ggf3 file from NCBI but I donot know how can get it .
                Could anyone help me to know the answer
                Thanks

                Comment


                • #9
                  Bubalus bubalis? ftp://ftp.ncbi.nlm.nih.gov/genomes/Bubalus_bubalis/GFF/
                  Last edited by maubp; 08-12-2014, 02:00 AM.

                  Comment


                  • #10
                    Thanks Peter

                    Comment

                    Latest Articles

                    Collapse

                    • seqadmin
                      Recent Advances in Sequencing Analysis Tools
                      by seqadmin


                      The sequencing world is rapidly changing due to declining costs, enhanced accuracies, and the advent of newer, cutting-edge instruments. Equally important to these developments are improvements in sequencing analysis, a process that converts vast amounts of raw data into a comprehensible and meaningful form. This complex task requires expertise and the right analysis tools. In this article, we highlight the progress and innovation in sequencing analysis by reviewing several of the...
                      Yesterday, 07:48 AM
                    • seqadmin
                      Essential Discoveries and Tools in Epitranscriptomics
                      by seqadmin




                      The field of epigenetics has traditionally concentrated more on DNA and how changes like methylation and phosphorylation of histones impact gene expression and regulation. However, our increased understanding of RNA modifications and their importance in cellular processes has led to a rise in epitranscriptomics research. “Epitranscriptomics brings together the concepts of epigenetics and gene expression,” explained Adrien Leger, PhD, Principal Research Scientist...
                      04-22-2024, 07:01 AM

                    ad_right_rmr

                    Collapse

                    News

                    Collapse

                    Topics Statistics Last Post
                    Started by seqadmin, Yesterday, 07:17 AM
                    0 responses
                    11 views
                    0 likes
                    Last Post seqadmin  
                    Started by seqadmin, 05-02-2024, 08:06 AM
                    0 responses
                    19 views
                    0 likes
                    Last Post seqadmin  
                    Started by seqadmin, 04-30-2024, 12:17 PM
                    0 responses
                    20 views
                    0 likes
                    Last Post seqadmin  
                    Started by seqadmin, 04-29-2024, 10:49 AM
                    0 responses
                    29 views
                    0 likes
                    Last Post seqadmin  
                    Working...
                    X