Unconfigured Ad

Collapse
X
 
  • Filter
  • Time
  • Show
Clear All
new posts
  • GSviral
    Member
    • Dec 2014
    • 38

    Creating a Command Line BLAST+ Database

    Hey guys,

    I am relatively new to linux and command line and have a basic understanding of the basic commands in regards to the terminal.

    What I want to do is set up local BLAST+ databases so I can BLAST hundreds of sequences against the blastx or blastn databases.

    So far I have downloaded the latest databases, extracted them via gunzip, and copied them into my usr/local/bin folder. I believe this allows me to run BLAST applications from anywhere.

    I do not know where to go next. Do I have to format the blastx and blastn databases before I can blast my sequences? If so can any of you please point me in the right direction in how to do this?

    Any help is appreciated, thank you very much!

    Ryan.
  • GenoMax
    Senior Member
    • Feb 2008
    • 7142

    #2
    As long as you downloaded the pre-formatted databases from (ftp://ftp.ncbi.nlm.nih.gov/blast/db/) you can use them directly with blast+. You have also got the executables for your OS: ftp://ftp.ncbi.nlm.nih.gov/blast/exe...last+/LATEST/?

    Follow the command line manual for blast to start your analysis: http://www.ncbi.nlm.nih.gov/books/NBK1763/

    Keep realistic expectations if you are going to do the blast locally, specially if you have modest hardware (post specs if you need any specific comments on that). It will take time to blastx several hundred sequences against nr.

    Comment

    • GSviral
      Member
      • Dec 2014
      • 38

      #3
      Hello GenoMax,

      Thanks for the help. To begin I have downloaded the pre-formatted nt database.

      Once I unzip all the files associated with the nt database do I place them in the same directory as I have placed the BLAST+ executables? Once this is done I assume I will be able to run a search using one of my fasta files against the nt database.

      Thank you,

      Ryan.

      Comment

      • GenoMax
        Senior Member
        • Feb 2008
        • 7142

        #4
        Originally posted by GSviral View Post
        Hello GenoMax,

        Thanks for the help. To begin I have downloaded the pre-formatted nt database.

        Once I unzip all the files associated with the nt database do I place them in the same directory as I have placed the BLAST+ executables? Once this is done I assume I will be able to run a search using one of my fasta files against the nt database.

        Thank you,

        Ryan.
        You can put the database files where ever convenient. When you start your blast you will specify DB location by using the -db command line option and providing the path to the files. All the pieces for a set (numerically named files) need to be in the same directory and you only need to use the base name i.e. nr (do not include the numbers)

        Comment

        Latest Articles

        Collapse

        • SEQadmin2
          Nine Things a Sample Prep Scientist Thinks About Before Sequencing
          by SEQadmin2


          I’m not a sequencing expert. I’m a purification scientist who uses NGS to evaluate workflows my group develops. With this perspective, we think about the sample first and the NGS workflow second. The sequencer is an exceptionally honest reporter, but it can only report on what you give it, so whether you get clean, interpretable data from an NGS workflow is largely determined before you begin.

          Here are nine questions we think about, in roughly the order they matter, before...
          06-18-2026, 07:11 AM
        • SEQadmin2
          From Collection to Sequencing: Why Sample Preparation and Preservation Define Sequencing Data
          by SEQadmin2


          Data variability is still an issue in sequencing technologies despite the advances in reproducibility and accuracy of these platforms. But the problem does not originate in the sequencing itself, but in the previous steps, before the sample reaches the sequencer.


          The first step is collection, followed by preservation and sample preparation for analysis. Most scientists overlook those steps, but not being careful might just be skewing the experiment’s results.
          ...
          06-02-2026, 10:05 AM

        ad_right_rmr

        Collapse

        News

        Collapse

        Topics Statistics Last Post
        Started by SEQadmin2, 06-26-2026, 11:10 AM
        0 responses
        16 views
        0 reactions
        Last Post SEQadmin2  
        Started by SEQadmin2, 06-17-2026, 06:09 AM
        0 responses
        49 views
        0 reactions
        Last Post SEQadmin2  
        Started by SEQadmin2, 06-09-2026, 11:58 AM
        0 responses
        108 views
        0 reactions
        Last Post SEQadmin2  
        Started by SEQadmin2, 06-05-2026, 10:09 AM
        0 responses
        125 views
        0 reactions
        Last Post SEQadmin2  
        Working...