Seqanswers Leaderboard Ad

Collapse

Announcement

Collapse
No announcement yet.
X
 
  • Filter
  • Time
  • Show
Clear All
new posts

  • Combining BLAST nt and local database

    Hi all,

    I've made a custom local BLAST database and I'm trying to 'combine' it with the BLAST nt database using ncbi-blast-2.9.0+ blastdb_aliastool but I'm getting the error:

    "BLAST Database error: BLASTDB alias file creation failed. Some referenced files may be missing"

    Here's what I've done:

    In the directory nt, on our LINUX computing cluster, I've downloaded the most recent version of the BLAST nt database and unpacked it

    $wget "ftp://ftp.ncbi.nlm.nih.gov/blast/db/nt.??.tar.gz"

    $for f in *.tar.gz; do tar -zxvf $f; rm $f; done

    This completes with no errors and produces what seems to be the correct number of files (10 files for each nt.* part of the database, plus nt.nal, and taxdb.btd and taxdb.bti). All parts of the nt database are present (nt.00 to nt.85) and all these parts are listed in the nt.nal file.

    Next, I make my local database:

    I scp my fasta file and my taxid file to the nt folder, so everything is in the same folder.

    $makeblastdb -in CocoaPESTLocal.fasta -parse_seqids -taxid_map CocoaPESTLocal_TaxonomyIDs.txt -blastdb_version 5 -title "Local database" -dbtype nucl -out loc

    Output:
    Building a new DB, current time: 12/24/2019 11:54:52
    New DB name: /ddn/data/mdzm87/BLASTdatabases/nt
    New DB title: Local database
    Sequence type: Nucleotide
    Keep MBits: T
    Maximum file size: 1000000000B
    Adding sequences from FASTA; added 44 sequences in 0.0142331 seconds.

    And nine files are created.

    Next, I try to 'combine' the databases using the blastdb_aliastool:

    $blastdb_aliastool -dblist "loc nt" -dbtype nucl -out comb -title "Combined database"

    BLAST Database error: BLASTDB alias file creation failed. Some referenced files may be missing


    I also tried different variants of the command:

    $blastdb_aliastool -dblist "loc nt.00 nt.01 nt.02 nt.03 nt.04 nt.05 nt.06 nt.07 nt.08 nt.09 nt.10 nt.11 nt.12 nt.13 nt.14 nt.15 nt.16 nt.17 nt.18 nt.19 nt.20 nt.21 nt.22 nt.23 nt.24 nt.25 nt.26 nt.27 nt.28 nt.29 nt.30 nt.31 nt.32 nt.33 nt.34 nt.35 nt.36 nt.37 nt.38 nt.39 nt.40 nt.41 nt.42 nt.43 nt.44 nt.45 nt.46 nt.47 nt.48 nt.49 nt.50 nt.51 nt.52 nt.53 nt.54 nt.55 nt.56 nt.57 nt.58 nt.59 nt.60 nt.61 nt.62 nt.63 nt.64 nt.65 nt.66 nt.67 nt.68 nt.69 nt.70 nt.71 nt.72 nt.73 nt.74 nt.75 nt.76 nt.77 nt.78 nt.79 nt.80 nt.81 nt.82 nt.83 nt.84 nt.85" -dbtype nucl -out comb -title "Combined database"

    and

    $blastdb_aliastool -dblist "loc nt.nal" -dbtype nucl -out comb -title "Combined database"

    And get the same error.

    However, if I make two copies of my local reference database (with different names loc and local)

    $blastdb_aliastool -dblist "local loc" -dbtype nucl -out comb -title "Combined database"

    Created nucleotide BLAST (alias) database comb with 88 sequences

    So it seems to be something wrong specifically with trying to 'combine' the BLAST nt database specifically with my local database.

    The only thing I notice is that the BLAST nt database has different names for some parts than my local database:

    nt.00.nhd
    nt.00.nhi
    nt.00.nhr
    nt.00.nin
    nt.00.nnd
    nt.00.nni
    nt.00.nog
    nt.00.nsd
    nt.00.nsi
    nt.00.nsq
    ....

    loc.ndb
    loc.nhr
    loc.nin
    loc.nog
    loc.nos
    loc.not
    loc.nsq
    loc.ntf
    loc.nto


    I've consulted the Blast+ manual and done lots of Googling but I can't seem to figure it out. Does anyone have any ideas?

    Thanks,
    Andreanna

Latest Articles

Collapse

  • seqadmin
    Non-Coding RNA Research and Technologies
    by seqadmin




    Non-coding RNAs (ncRNAs) do not code for proteins but play important roles in numerous cellular processes including gene silencing, developmental pathways, and more. There are numerous types including microRNA (miRNA), long ncRNA (lncRNA), circular RNA (circRNA), and more. In this article, we discuss innovative ncRNA research and explore recent technological advancements that improve the study of ncRNAs.

    Nobel Prize for MicroRNA Discovery
    This week,...
    Yesterday, 08:07 AM
  • seqadmin
    Recent Developments in Metagenomics
    by seqadmin





    Metagenomics has improved the way researchers study microorganisms across diverse environments. Historically, studying microorganisms relied on culturing them in the lab, a method that limits the investigation of many species since most are unculturable1. Metagenomics overcomes these issues by allowing the study of microorganisms regardless of their ability to be cultured or the environments they inhabit. Over time, the field has evolved, especially with the advent...
    09-23-2024, 06:35 AM
  • seqadmin
    Understanding Genetic Influence on Infectious Disease
    by seqadmin




    During the COVID-19 pandemic, scientists observed that while some individuals experienced severe illness when infected with SARS-CoV-2, others were barely affected. These disparities left researchers and clinicians wondering what causes the wide variations in response to viral infections and what role genetics plays.

    Jean-Laurent Casanova, M.D., Ph.D., Professor at Rockefeller University, is a leading expert in this crossover between genetics and infectious...
    09-09-2024, 10:59 AM

ad_right_rmr

Collapse

News

Collapse

Topics Statistics Last Post
Started by seqadmin, 10-02-2024, 04:51 AM
0 responses
87 views
0 likes
Last Post seqadmin  
Started by seqadmin, 10-01-2024, 07:10 AM
0 responses
95 views
0 likes
Last Post seqadmin  
Started by seqadmin, 09-30-2024, 08:33 AM
1 response
96 views
0 likes
Last Post EmiTom
by EmiTom
 
Started by seqadmin, 09-26-2024, 12:57 PM
0 responses
20 views
0 likes
Last Post seqadmin  
Working...
X