Seqanswers Leaderboard Ad

**nickloman** · 11-17-2012, 12:50 PM

Hey

Not a full solution, but MEGAN provides files which map GIs to taxon IDs for nt and nr via this link: http://ab.inf.uni-tuebingen.de/data/...d/welcome.html

Hope that helps

**Richard Finney** · 11-17-2012, 01:06 PM

Easiest method to get taxonomy ids ...
Just check out this directory: ftp://ftp.ncbi.nih.gov/pub/taxonomy/

________________
If you want bacteria and virsus genome in fasta format files ...

Check out doucmentation here :

NCBI file extensions

http://defindit.com/readme_files/ncbi_file_extension_format.html

A brief description of file extensions and file formats found at the National Center for Biotechnology Information

for NCBI file name extensions.

You can ftp download data from NCBI here :
ftp://ftp.ncbi.nlm.nih.gov/genomes/Bacteria/
Look for the all* files. The ftp://ftp.ncbi.nlm.nih.gov/genomes/B...all.fna.tar.gz file should be all bacterial genomes.

Virae here : ftp://ftp.ncbi.nlm.nih.gov/genomes/Viruses/

"WGS bacteria OLD" is thereabouts, just look around. Draft genomes there abouts, too.

_____

Alternate way to get taxon ids for example bacteria ...

You can get the file "all rpt" file via wget :
wget ftp://ftp.ncbi.nlm.nih.gov/genomes/B...all.rpt.tar.gz
Unzip and untar.

Run the command
-bash-3.00$ find . -name '*.rpt' -exec grep Taxid {} \; | sort | uniq
There you go.

**kga1978** · 11-17-2012, 03:32 PM

Wow, thanks so much guys - this was incredible helpful! I got it all covered now

Topics	Statistics	Last Post
ASHG 2024 Highlights – Part Two by seqadmin Started by seqadmin, Today, 11:09 AM	0 responses 22 views 0 likes	Last Post by seqadmin Today, 11:09 AM
ASHG 2024 Highlights – Part One by seqadmin Started by seqadmin, Today, 06:13 AM	0 responses 20 views 0 likes	Last Post by seqadmin Today, 06:13 AM
Seq-Scope Expands Possibilities for High-Resolution Gene Expression Analysis by seqadmin Started by seqadmin, 11-01-2024, 06:09 AM	0 responses 30 views 0 likes	Last Post by seqadmin 11-01-2024, 06:09 AM
New Model Aims to Explain Polygenic Diseases by Connecting Genomic Mutations and Regulatory Networks by seqadmin Started by seqadmin, 10-30-2024, 05:31 AM	0 responses 21 views 0 likes	Last Post by seqadmin 10-30-2024, 05:31 AM

Seqanswers Leaderboard Ad

Announcement

Extracting all microbial sequences from NT

Comment

Comment

Comment

Latest Articles

ad_right_rmr

News