The NCBI Map Viewer has the latest pig genome build and shows the
locations of all the genes. I would like to download this gene
annotation so I can load it into my own GBrowse genome browser.
So I need the NCBI gene annotation for the latest pig genome build in
gff3 format, and the way to do it seems to be to download an asn.1
file from NCBI, convert it to genbank format, and then use the bioperl
script bp_genbank2gff3.pl to convert from genbank to gff3.
I downloaded the gene annotation for the pig genome from the NCBI ftp site at
ftp://ftp.ncbi.nlm.nih.gov/gene/DATA..._scrofa.ags.gz
I downloaded the asn2gb conversion program from
ftp://ftp.ncbi.nlm.nih.gov/asn1-conv...latform/linux/
I run ./linux.asn2gb -i Sus_scrofa.ags -b T
and get the error "Asn io_failure for input file 'Sus_scrofa.ags'"
I've tried all the options for the -a and -t flags without luck.
I'm able to convert the Sus_scrofa.ags file to xml format using the
gene2xml program, but I don't know of any tool that can convert from
XML to gff3.
I downloaded a genbank format file of pig genes from
ftp://ftp.ncbi.nlm.nih.gov/genomes/S...RNA/rna.gbk.gz but the
file doesn't give chromosome coordinates for the genes, so I can't
make a gff3 file out of it.
Any pointers on how to use the asn tools properly, or how to get NCBI
annotation in gff format in general, would be much appreciated.
Thanks
-John
locations of all the genes. I would like to download this gene
annotation so I can load it into my own GBrowse genome browser.
So I need the NCBI gene annotation for the latest pig genome build in
gff3 format, and the way to do it seems to be to download an asn.1
file from NCBI, convert it to genbank format, and then use the bioperl
script bp_genbank2gff3.pl to convert from genbank to gff3.
I downloaded the gene annotation for the pig genome from the NCBI ftp site at
ftp://ftp.ncbi.nlm.nih.gov/gene/DATA..._scrofa.ags.gz
I downloaded the asn2gb conversion program from
ftp://ftp.ncbi.nlm.nih.gov/asn1-conv...latform/linux/
I run ./linux.asn2gb -i Sus_scrofa.ags -b T
and get the error "Asn io_failure for input file 'Sus_scrofa.ags'"
I've tried all the options for the -a and -t flags without luck.
I'm able to convert the Sus_scrofa.ags file to xml format using the
gene2xml program, but I don't know of any tool that can convert from
XML to gff3.
I downloaded a genbank format file of pig genes from
ftp://ftp.ncbi.nlm.nih.gov/genomes/S...RNA/rna.gbk.gz but the
file doesn't give chromosome coordinates for the genes, so I can't
make a gff3 file out of it.
Any pointers on how to use the asn tools properly, or how to get NCBI
annotation in gff format in general, would be much appreciated.
Thanks
-John
Comment