Unconfigured Ad

Collapse
X
 
  • Filter
  • Time
  • Show
Clear All
new posts
  • kalyankpy
    PostDoc
    • Mar 2010
    • 20

    Blast database creation : Error

    HI,

    I am new to Bioinformatics. I am creating a database with the fasta formatted sequences from high througput sequncing data (20-30 bp in length). While I create the blast database (version ncbi-blast-2.2.23), I notice an error which is not comprehendible to me. I request you to help me:

    Command used:
    ============
    $ makeblastdb -in puts2.fa -dbtype nucl -parse_seqids -mask_data puts2_mask.asnb -out puts2db -title "puts2_database"

    Error message:
    ===========
    Error: (803.7) Bad char in VisibleString: 9

    Do help me in solving this issue
  • bcanback
    Junior Member
    • Sep 2008
    • 1

    #2
    makeblastdb Bad char in VisibleString

    Hi!

    I had the same message today. In my case it is was due to a tab found in the id line. Hope your problem is equally simple.

    cheers

    Björn

    Comment

    • kalyankpy
      PostDoc
      • Mar 2010
      • 20

      #3
      Hi Bjorn,

      In my case there wasnt any problem in the data. Rather when I uninstalled the ....24 version of the blast++ and worked with 23 version things worked out pretty smooth. Later I noticed that there is a bug in that version.

      Comment

      • fishinabarrel
        Junior Member
        • Apr 2011
        • 6

        #4
        Had the same issue.

        Tab was causing the problem in the header.


        Simple fix in vi.
        From the colon :

        :%s/\t//g

        *replaces all tabs with nothing.

        Comment

        • kaem
          Junior Member
          • Mar 2012
          • 1

          #5
          Hi all

          I also create with makeblastdb 2.2.26+ - I received the same error message+
          Error: (803.7) Bad char in VisibleString: 9
          and deleted the tabs.

          however, making a database from Aedes Aedes_aegypti_Aaeg_1.2 i had two other error messages:

          Error: (803.7) Bad char in VisibleString: 195
          Error: (803.7) Bad char in VisibleString: 164

          Does anybody know what it means?
          thanks kaem

          Comment

          • PenelopeFudd
            Junior Member
            • Feb 2012
            • 8

            #6
            Apparently the number at the end of the message (9, 164, or 195) is the character code that it didn't like. So, the '9' is a tab character, and 164 and 195 are just random characters that together probably mean 'dash' in Unicode or something.

            To fix it, edit the fasta file with 'vi', search for all characters that aren't printable, and delete them or something. I used '/[^ -|]/' (match anything outside the range of space to vertical-bar) to find the bad character in my file. It was between two words in a header line, not in a sequence: "thioredoxin(?)peroxidase", so I replaced it with a dash.

            Hope this helps you, or anyone googling after you!

            Originally posted by kaem View Post
            Hi all

            I also create with makeblastdb 2.2.26+ - I received the same error message+
            Error: (803.7) Bad char in VisibleString: 9
            and deleted the tabs.

            however, making a database from Aedes Aedes_aegypti_Aaeg_1.2 i had two other error messages:

            Error: (803.7) Bad char in VisibleString: 195
            Error: (803.7) Bad char in VisibleString: 164

            Does anybody know what it means?
            thanks kaem

            Comment

            • Benoit_UMons
              Junior Member
              • Jun 2013
              • 1

              #7
              Hi all,

              How works the software with this error? They correct it?

              Because, my database is too big (13Go). I can't open it to correct the error manually.

              My bad character is 0X92 (i thing it's " , ").

              Can you help me please?

              Thank you very much,

              Benoit

              Comment

              • jjzieve
                Junior Member
                • Feb 2014
                • 3

                #8
                I'm new to seqanswers so forgive me for not knowing how to start a new thread. I'm having a strange issue... I'm running "makeblastdb -in seq.fsa -dbtye nucl" everything finishes fine (no error message) but there is no output (i.e. *.nin, *.nhr, *.nsq). I want to keep the outputs the same name as the inputs so I didn't specify "-out". I figured it might have saved in the $BLASTDB location but that isn't even set. One note is the fasta file is 16 GB, do I need to manually split it up and create *.nal or something? Thanks

                Comment

                • GenoMax
                  Senior Member
                  • Feb 2008
                  • 7142

                  #9
                  Originally posted by jjzieve View Post
                  I'm new to seqanswers so forgive me for not knowing how to start a new thread. I'm having a strange issue... I'm running "makeblastdb -in seq.fsa -dbtye nucl" everything finishes fine (no error message) but there is no output (i.e. *.nin, *.nhr, *.nsq). I want to keep the outputs the same name as the inputs so I didn't specify "-out". I figured it might have saved in the $BLASTDB location but that isn't even set. One note is the fasta file is 16 GB, do I need to manually split it up and create *.nal or something? Thanks
                  Starting new threads:

                  SeqAnswers.com --> "Forums" (from site navigation at top left) --> Choose an appropriate forum --> "New Thread" button at top left.

                  It is always a good idea to search the forum first with some keywords.

                  See this thread: http://seqanswers.com/forums/showthread.php?t=34619 and the link in the thread to NCBI blast command line manual.

                  How much memory do you have on this machine? For a 16GB sequence file it should take a while to make the database. How long did your job take?

                  Comment

                  • jjzieve
                    Junior Member
                    • Feb 2014
                    • 3

                    #10
                    We have 16 gigs of RAM, the job only took about 5 minutes. So... I'm thinking maybe the job quit with no errors? I have access to a high memory machine on a HPC I'll try running it there and let you know how that works out.

                    Comment

                    • GenoMax
                      Senior Member
                      • Feb 2008
                      • 7142

                      #11
                      In case you still have trouble try to capture the standard error/out messages to a file to see if that provides any additional info.

                      Comment

                      • jjzieve
                        Junior Member
                        • Feb 2014
                        • 3

                        #12
                        Worked like a charm on a high memory node running ubuntu 13.04. Do you think it would be worthwhile to put in a bug request to the blast people? That essentially, their were no errors reported for what was probably a stack overflow? The other system I was on was CentOS 6.4, btw.

                        Comment

                        Latest Articles

                        Collapse

                        • SEQadmin2
                          From Collection to Sequencing: Why Sample Preparation and Preservation Define Sequencing Data
                          by SEQadmin2


                          Data variability is still an issue in sequencing technologies despite the advances in reproducibility and accuracy of these platforms. But the problem does not originate in the sequencing itself, but in the previous steps, before the sample reaches the sequencer.


                          The first step is collection, followed by preservation and sample preparation for analysis. Most scientists overlook those steps, but not being careful might just be skewing the experiment’s results.
                          ...
                          06-02-2026, 10:05 AM
                        • SEQadmin2
                          Single-Cell Sequencing at an Inflection Point: Early Impacts of New Platforms and Emerging Trends
                          by SEQadmin2


                          With the launch of new single-cell sequencing platforms in 2026, the field stands at an exciting inflection point. This article surveys the most impactful advances in the field and discusses how they’re reshaping research in cancer, immunology, and beyond.


                          Introduction

                          Single-cell sequencing technologies have undergone remarkable advances over the past decade, transitioning from low-throughput experimental approaches to highly scalable platforms capable of...
                          05-22-2026, 06:42 AM
                        • SEQadmin2
                          Environmental Genomics in the Age of NGS: From Microbes to Conservation Strategies
                          by SEQadmin2

                          Studying ecosystems means dealing with complex, multi-species communities that are hard to observe at scale. This complexity, however, hides many important questions to be answered, from how biogeochemical cycles work and how climate change can affect species distribution to how conservation strategies can work best.


                          Genomics, particularly since the expansion of NGS, has transformed ecosystem ecology. By sequencing environmental DNA, we can now assess biodiversity without direct...
                          05-06-2026, 09:04 AM

                        ad_right_rmr

                        Collapse

                        News

                        Collapse

                        Topics Statistics Last Post
                        Started by SEQadmin2, Today, 08:59 AM
                        0 responses
                        10 views
                        0 reactions
                        Last Post SEQadmin2  
                        Started by SEQadmin2, 06-02-2026, 12:03 PM
                        0 responses
                        21 views
                        0 reactions
                        Last Post SEQadmin2  
                        Started by SEQadmin2, 06-02-2026, 11:40 AM
                        0 responses
                        17 views
                        0 reactions
                        Last Post SEQadmin2  
                        Started by SEQadmin2, 05-28-2026, 11:40 AM
                        0 responses
                        31 views
                        0 reactions
                        Last Post SEQadmin2  
                        Working...