Unconfigured Ad

Collapse
X
 
  • Filter
  • Time
  • Show
Clear All
new posts
  • GSviral
    Member
    • Dec 2014
    • 38

    "Invalid byte in GI list"

    Hello everyone,

    I am currently working with a local BLAST nucleotide database. After getting it set up I am able to BLAST FASTA files without any bother.

    What I wanted to do was only search the nt database for viruses.

    To this end I have downloaded the virus accession list from NCBI.

    I try to use the following command:

    $ blastn -db nt -query sequence.fasta -num_alignments 10 -num_descriptions 10 -evalue 1e-6 -gilist viruses.nbr -num_threads 4 -out sequence.tab

    When I input this command I get a result saying "Invalid byte in GI list" and the command does not run. Can anyone help me out with this error message? Has there been a problem downloading the accession list file?

    Thanks for the help.
  • GenoMax
    Senior Member
    • Feb 2008
    • 7142

    #2
    Is there a header present in your gilist file? If it is there is try removing that.
    Last edited by GenoMax; 10-12-2015, 07:23 AM.

    Comment

    • GSviral
      Member
      • Dec 2014
      • 38

      #3
      Hi Genomax,

      Thanks for the advice. Yeah, there were header lines indicating accession number, organism name etc.

      Instead of using 'gilist' I ended up using 'seqidlist' which accepted my downloaded file. I am not sure if the results will differ using 'gilist' successfully but I will indeed try and remove the headers and re-run using 'gilist' to see if there are any differences.

      Cheers!

      Comment

      • GenoMax
        Senior Member
        • Feb 2008
        • 7142

        #4
        If you had "gi's" then it may be best to stick with gilist option. Not sure if the gi is equivalent to seqid.

        If you expect to do this often then consider sub-setting the viruses set permanently.

        Comment

        • GSviral
          Member
          • Dec 2014
          • 38

          #5
          I had limited success with the GI List option. I have realised that the virus taxa file I downloaded was for whole genomes and not partially sequenced genomes.

          I went back and downloaded the GenBank viral database in a FASTA file.

          From this I want to make a custom viral database to put my sequences through in order to speed up processing time and get the data I want without any bacterial sequences etc. however a new problem occurred.

          when typing $ makeblastdb -help in order to even just get the possible options I get a 'segmentation fault' error. Is this due to RAM limitations or problems with the BLAST+ application?

          Cheers.

          Comment

          • GenoMax
            Senior Member
            • Feb 2008
            • 7142

            #6
            What OS are you using?

            Comment

            • GSviral
              Member
              • Dec 2014
              • 38

              #7
              GenoMax,

              I am using GNOME CentOS 2.16.0.

              Outdated I am sure but my institution are picky about software unfortunately.

              Comment

              • GenoMax
                Senior Member
                • Feb 2008
                • 7142

                #8
                Looking at your blast command line you appear to be using the latest blast+ package. Can you confirm that? If blastn from that package worked then I am not sure why you are getting a seg fault with makeblastdb. Perhaps that is using a library that is missing from your system. Going to be hard to fix.

                BTW: Are you really using a 10+ year old OS (if I googled it right)?

                Comment

                • GSviral
                  Member
                  • Dec 2014
                  • 38

                  #9
                  GenoMax,

                  Yep, I am using the latest BLAST+ package - 2.2.31 along with the most recent nt database.

                  Could it be possible an OS update could fix the problem?

                  And yes ha, we are using a 10 year old OS. As I mentioned my institute can be ridiculously picky when installing new software due to security measures. Even so a 10 year old OS is a bit ridiculous really.

                  Comment

                  • GenoMax
                    Senior Member
                    • Feb 2008
                    • 7142

                    #10
                    You need a complete reinstall of a newer vintage OS

                    On a serious note, if you are not able to update the OS you could try compiling blast from source code (I am not even sure if that will work). Blast may expect latest libraries and such that are likely not going to be available in a 10 yr old OS. Even the compiler you have available will likely not work.

                    Comment

                    • GSviral
                      Member
                      • Dec 2014
                      • 38

                      #11
                      Thanks for the help GenoMax. I may just have to find a different computer to do this all on unfortunately.

                      Just as a revision, in case I have installed something incorrectly.

                      I downloaded the latest NCBI BLAST+ package which I then extracted.

                      I also downloaded the most recent nucleotide database which I extracted in to the .bin folder of the extracted BLAST+ package. Does this all sound correct?

                      I do not have much experience when running command line so perhaps I have installed something incorrectly.

                      Comment

                      • GenoMax
                        Senior Member
                        • Feb 2008
                        • 7142

                        #12
                        For the purpose of what you were trying to run that all sounds right. I am surprised that even blastn worked considering makeblastdb generates a seg fault.

                        Comment

                        • GSviral
                          Member
                          • Dec 2014
                          • 38

                          #13
                          Thanks GenoMax.

                          Seems I will have to find an alternative route.

                          Sometimes if I am BLASTing a particularly large FASTA file I will get a segmentation fault a short time after the command has been input and it stops the process then and there resulting in incomplete output.

                          Comment

                          Latest Articles

                          Collapse

                          • GATTACAT
                            Reply to Nine Things a Sample Prep Scientist Thinks About Before Sequencing
                            by GATTACAT
                            Love this - good data definitely starts from good input, and poor input can only give relatively poor data. I particularly like the mention of Nanodrop/absorbance based methods for quantification. It's such a toss up if you'll get an accurate reading or what amounts to a randomly generated number, and a lot of library/sequencing related issues can be traced back to poor quant.
                            07-01-2026, 11:43 AM
                          • SEQadmin2
                            Nine Things a Sample Prep Scientist Thinks About Before Sequencing
                            by SEQadmin2


                            I’m not a sequencing expert. I’m a purification scientist who uses NGS to evaluate workflows my group develops. With this perspective, we think about the sample first and the NGS workflow second. The sequencer is an exceptionally honest reporter, but it can only report on what you give it, so whether you get clean, interpretable data from an NGS workflow is largely determined before you begin.

                            Here are nine questions we think about, in roughly the order they matter, before...
                            06-18-2026, 07:11 AM

                          ad_right_rmr

                          Collapse

                          News

                          Collapse

                          Topics Statistics Last Post
                          Started by SEQadmin2, Yesterday, 11:08 AM
                          0 responses
                          7 views
                          0 reactions
                          Last Post SEQadmin2  
                          Started by SEQadmin2, 06-30-2026, 05:37 AM
                          0 responses
                          11 views
                          0 reactions
                          Last Post SEQadmin2  
                          Started by SEQadmin2, 06-26-2026, 11:10 AM
                          0 responses
                          19 views
                          0 reactions
                          Last Post SEQadmin2  
                          Started by SEQadmin2, 06-17-2026, 06:09 AM
                          0 responses
                          53 views
                          0 reactions
                          Last Post SEQadmin2  
                          Working...