Unconfigured Ad

**mastal** · 12-12-2013, 08:06 AM

Have you read the online documentation about the pre-formatted
blast databases:

ftp://ftp.ncbi.nlm.nih.gov/blast/documents/blastdb.html

The preformatted database files are already formatted, so you don't
need to run makeblastdb.

**LeightonP** · 12-12-2013, 09:24 AM

Originally posted by andreanna05

What I really want is just the sequences from one model organism, but I don't see a species-specific pre-formatted blast database for it.

Your best option then is to download the sequences from that model organism, and use makeblastdb to construct a BLAST database from them.

You can find the makeblastdb documentation here: http://nebc.nerc.ac.uk/bioinformatic...keblastdb.html

**GenoMax** · 12-12-2013, 10:31 AM

Two possible options to consider if you are only interested in creating a db of sequences from a specific organism. In either case you can create your own blast db (makeblastdb) once you get the sequences together.

1. If you are not averse to downloading files (there are multiple) for the nr blast index than you could use the blastdbcmd command to extract sequences specific to your organism. Look for the section on extracting sequences using blastdbcmd in this manual: http://www.ncbi.nlm.nih.gov/books/NBK1763/

From NCBI:

Extract all human sequences from the nr database

Although one cannot select GIs by taxonomy from a database, a combination of unix command line tools will accomplish this:

$ blastdbcmd -db nr -entry all -outfmt "%g %T" | \
awk ' { if ($2 == 9606) { print $1 } } ' | \
blastdbcmd -db nr -entry_batch - -out human_sequences.txt

2. You could also use NCBI eutils to perform a query to get the sequence data you need. Manual for that is here: http://www.ncbi.nlm.nih.gov/books/NBK1058/
Application #3 retrieving large datasets may work.

Topics	Statistics	Last Post
Whole-Genome Sequencing Traces Faroe Islands Ancestry to a North Atlantic Founder Population by SEQadmin2 Started by SEQadmin2, Today, 06:09 AM	0 responses 9 views 0 reactions	Last Post by SEQadmin2 Today, 06:09 AM
Sequencing the Two-Toed Sloth Genome Reveals Jumping Genes Tied to Its Extreme Metabolism by SEQadmin2 Started by SEQadmin2, 06-09-2026, 11:58 AM	0 responses 33 views 0 reactions	Last Post by SEQadmin2 06-09-2026, 11:58 AM
A New Method Makes Hantavirus Genome Analysis Faster and More Accessible by SEQadmin2 Started by SEQadmin2, 06-05-2026, 10:09 AM	0 responses 38 views 0 reactions	Last Post by SEQadmin2 06-05-2026, 10:09 AM
A New Single-Cell Method Maps DNA-Protein Interactions by SEQadmin2 Started by SEQadmin2, 06-04-2026, 08:59 AM	0 responses 43 views 0 reactions	Last Post by SEQadmin2 06-04-2026, 08:59 AM

Unconfigured Ad

blast+ and pre-formatted databases

Comment

Comment

Comment

Latest Articles

ad_right_rmr

News