Seqanswers Leaderboard Ad

Collapse
X
 
  • Filter
  • Time
  • Show
Clear All
new posts
  • salamay
    Member
    • May 2014
    • 20

    BLASTX Issues

    I am attempting to run blastx from the command line, and it seems to be very slow (~5-6 minutes for 10 100bp sequences). I compared this to blastx on the NCBI website, which took only 30 seconds. This indicates that there is an issue with the command line usage of blastx, but I don't know where to start examining this issue.

    I made sure that the db used in the command line and on the ncbi website was the same (nr). I am using all default parameters.

    Any help would be appreciated.

    Thanks
  • GenoMax
    Senior Member
    • Feb 2008
    • 7142

    #2
    That is an apples to oranges comparison.

    Hardware NCBI uses for Blast searches is going to be tremendously more powerful than anything you are using locally.

    What kind of hardware are you running your searches on? How much RAM are you allocating to your searches locally?

    Comment

    • salamay
      Member
      • May 2014
      • 20

      #3
      The machine I run the searches on is a 48 core 2.7ghz with 264 Gb RAM, but not sure how much of that is allocated for the search (nor do I know how to specify allocation). I run the search with default parameters so num_threads is set to 1 by default. Even though, I don't think it should be that slow. Or am I wrong?

      Comment

      • GenoMax
        Senior Member
        • Feb 2008
        • 7142

        #4
        If you have 48 cores then you would certainly want to up that num_threads option. You may need to experiment with that number to find the sweet spot for your hardware configuration. Start at the half-way point (and go up and down).

        If you have admin access on the machine you may want to make a RAMDISK and copy the database files there (http://askubuntu.com/questions/15286...ake-a-ram-disk) If you are not using linux then look for an appropriate set of instructions for your OS.

        Comment

        • salamay
          Member
          • May 2014
          • 20

          #5
          Increasing the threads definitely helps. I guess I'll fiddle around with that. Thanks GenoMax!

          Comment

          • debem
            Junior Member
            • Dec 2014
            • 1

            #6
            Hi ..
            I am trying to get only the "no hits" result after perform the Blastx on the command line

            I've tried:

            <my Blastx command> | tee redirect.result33.txt | cut -d ' ' -f 1 | uniq | comm -31 - ids33_query.txt

            Any help would be appreciated.

            Thanks
            Last edited by debem; 12-09-2014, 05:42 PM.

            Comment

            • Rhewter
              Member
              • Sep 2014
              • 10

              #7
              Hi debem,

              Input your results in to two files.csv and try:
              sdiff 1.csv 2.csv > difer.csv

              Open the file difer.csv and select the lines with "<".
              This is your differents IDs.

              Big hub,
              Rhewter.

              Comment

              Latest Articles

              Collapse

              • seqadmin
                Pathogen Surveillance with Advanced Genomic Tools
                by seqadmin




                The COVID-19 pandemic highlighted the need for proactive pathogen surveillance systems. As ongoing threats like avian influenza and newly emerging infections continue to pose risks, researchers are working to improve how quickly and accurately pathogens can be identified and tracked. In a recent SEQanswers webinar, two experts discussed how next-generation sequencing (NGS) and machine learning are shaping efforts to monitor viral variation and trace the origins of infectious...
                03-24-2025, 11:48 AM
              • seqadmin
                New Genomics Tools and Methods Shared at AGBT 2025
                by seqadmin


                This year’s Advances in Genome Biology and Technology (AGBT) General Meeting commemorated the 25th anniversary of the event at its original venue on Marco Island, Florida. While this year’s event didn’t include high-profile musical performances, the industry announcements and cutting-edge research still drew the attention of leading scientists.

                The Headliner
                The biggest announcement was Roche stepping back into the sequencing platform market. In the years since...
                03-03-2025, 01:39 PM

              ad_right_rmr

              Collapse

              News

              Collapse

              Topics Statistics Last Post
              Started by seqadmin, 03-20-2025, 05:03 AM
              0 responses
              42 views
              0 reactions
              Last Post seqadmin  
              Started by seqadmin, 03-19-2025, 07:27 AM
              0 responses
              53 views
              0 reactions
              Last Post seqadmin  
              Started by seqadmin, 03-18-2025, 12:50 PM
              0 responses
              39 views
              0 reactions
              Last Post seqadmin  
              Started by seqadmin, 03-03-2025, 01:15 PM
              0 responses
              194 views
              0 reactions
              Last Post seqadmin  
              Working...