Unconfigured Ad

Collapse
X
 
  • Filter
  • Time
  • Show
Clear All
new posts
  • pico
    Junior Member
    • Jan 2013
    • 4

    Problem with making blast database

    Hello,

    I have a problem with makeblastdb in command line. I'm trying to make a database of a multi-fasta file containing 200 genes but the job is taking too much time, i have leave it for the week end but still no database created.

    My file is in fasta format, i have checked it many times. I don't know where the problem can be, do you have a solution for that or i'm maybe missing something...

    command line : makeblastdb -in genes.txt -dbtype nucl -out Db_genes.txt

    Building a new DB, current time: 03/25/2013 21:06:33
    New DB name: Db_genes.txt
    New DB title: genes.txt
    Sequence type: Nucleotide
    Keep Linkouts: T
    Keep MBits: T
    Maximum file size: 1000000000B
    .......it never finish the job...

    Thank you very much
  • A.N.Other
    Member
    • Feb 2012
    • 26

    #2
    I would normally use:

    Code:
    makeblastdb -in <in_fa_file> -input_type fasta -title <title> -out <out_file>
    -input_type defaults to fasta anyway, so the only issue I can see with your command is that you have '.txt' on the -out file name. makeblastdb will automatically give the extensions .nhr, .nin and .nsq to the three files it makes, so this is not necessary. Not sure if it's causing the problem, though. Make sure you've changed directory to the location would want the db created in before you run the script and make sure you've got the relevant permissions to do what you need.

    Comment

    • GenoMax
      Senior Member
      • Feb 2008
      • 7142

      #3
      With just 200 genes this should not be taking a week. What OS are you doing this on? Did you create your multi-fasta file on a PC and then are trying to create the indexes on a unix machine?

      Comment

      • pico
        Junior Member
        • Jan 2013
        • 4

        #4
        i'm on a mac and always worked on it. I think i have a problem with my multi fasta file but i don't know what it can be...

        Comment

        • danwiththeplan
          Member
          • Sep 2011
          • 72

          #5
          For 200 genes it should be done in seconds. Maybe check that there is no odd hidden line ending or wrapping on your fasta lines, although I don't know if this affects makeblastdb.
          I.e. in a text editor, each fasta line should be the header and the sequence below it, both only on one line, with no extra line endings or word wrapping. Use a text editor that displays hidden characters (I use Geany on linux, not sure what works on mac). Check permissions as the other guy said.

          Comment

          • Kennels
            Senior Member
            • Feb 2011
            • 149

            #6
            can you paste an example of a gene with the header and sequence here?
            It could be a problem with the header format, or as mentioned 'new line' format issue between different OS's.
            Also, I'd use '-parse_seqids' options too.

            Comment

            • Wallysb01
              Senior Member
              • Feb 2011
              • 286

              #7
              Some text editors will input strange line brake characters. Look at your file using the "more" or "less" command, this will show the file how makeblastdb will see it.

              Comment

              • pico
                Junior Member
                • Jan 2013
                • 4

                #8
                Hello,

                Thank you for all your answers, there was a line break in the middle of my text file!

                Comment

                Latest Articles

                Collapse

                • SEQadmin2
                  From Collection to Sequencing: Why Sample Preparation and Preservation Define Sequencing Data
                  by SEQadmin2


                  Data variability is still an issue in sequencing technologies despite the advances in reproducibility and accuracy of these platforms. But the problem does not originate in the sequencing itself, but in the previous steps, before the sample reaches the sequencer.


                  The first step is collection, followed by preservation and sample preparation for analysis. Most scientists overlook those steps, but not being careful might just be skewing the experiment’s results.
                  ...
                  06-02-2026, 10:05 AM
                • SEQadmin2
                  Single-Cell Sequencing at an Inflection Point: Early Impacts of New Platforms and Emerging Trends
                  by SEQadmin2


                  With the launch of new single-cell sequencing platforms in 2026, the field stands at an exciting inflection point. This article surveys the most impactful advances in the field and discusses how they’re reshaping research in cancer, immunology, and beyond.


                  Introduction

                  Single-cell sequencing technologies have undergone remarkable advances over the past decade, transitioning from low-throughput experimental approaches to highly scalable platforms capable of...
                  05-22-2026, 06:42 AM
                • SEQadmin2
                  Environmental Genomics in the Age of NGS: From Microbes to Conservation Strategies
                  by SEQadmin2

                  Studying ecosystems means dealing with complex, multi-species communities that are hard to observe at scale. This complexity, however, hides many important questions to be answered, from how biogeochemical cycles work and how climate change can affect species distribution to how conservation strategies can work best.


                  Genomics, particularly since the expansion of NGS, has transformed ecosystem ecology. By sequencing environmental DNA, we can now assess biodiversity without direct...
                  05-06-2026, 09:04 AM

                ad_right_rmr

                Collapse

                News

                Collapse

                Topics Statistics Last Post
                Started by SEQadmin2, Today, 08:59 AM
                0 responses
                7 views
                0 reactions
                Last Post SEQadmin2  
                Started by SEQadmin2, 06-02-2026, 12:03 PM
                0 responses
                21 views
                0 reactions
                Last Post SEQadmin2  
                Started by SEQadmin2, 06-02-2026, 11:40 AM
                0 responses
                14 views
                0 reactions
                Last Post SEQadmin2  
                Started by SEQadmin2, 05-28-2026, 11:40 AM
                0 responses
                29 views
                0 reactions
                Last Post SEQadmin2  
                Working...