Seqanswers Leaderboard Ad

Collapse

Announcement

Collapse
No announcement yet.
X
 
  • Filter
  • Time
  • Show
Clear All
new posts

  • westerman
    replied
    Ditto ... yeah!

    Leave a comment:


  • OTU
    replied
    Yeah! It works!
    Thank you so much!
    It took really a long time, to compute this!

    Leave a comment:


  • westerman
    replied
    OK, the pal file is there and looking good. Let's see what happens once you have the *.00.* files back in place. The lack of them *may* cause problems although I would have expected a different error message.

    Leave a comment:


  • OTU
    replied
    $ ls -l /home/bwawrik/db/refseq_protein*pal
    -rw-rw-r-- 1 bwawrik bwawrik 275 Dec 15 19:12 /home/bwawrik/db/refseq_protein.pal

    Leave a comment:


  • westerman
    replied
    Looks like your directory has extraneous files in it. They probably do not hurt. How about doing a

    ls -l /home/bwawrik/db/refseq_protein*pal

    Let's see if you have the overall index file.

    Leave a comment:


  • OTU
    replied
    I know the problem , why there is not .00 file - I accidently deleted it.
    I am downloading it now.
    And for the checking of '*.pal', we have problems:
    $ ls -l /home/bwawrik/db/refseq_protein* | tail --lines=2
    -rw-r--r-- 1 bwawrik bwawrik 59 Jan 21 11:39 /home/bwawrik/db/refseq_protein.2.08.tar.gz.md5
    -rw-r--r-- 1 bwawrik bwawrik 59 Jan 21 11:39 /home/bwawrik/db/refseq_protein.2.09.tar.gz.md5

    Leave a comment:


  • westerman
    replied
    OK. Now we are getting somewhere -- at least I can be sure that the paths look correct. What I find strange is that your database files begin with *.01.* -- mine begin with *.00.*; e.g.,

    /group/diagrid/databases/ncbi/week-04-2014/refseq_protein.00.phr
    /group/diagrid/databases/ncbi/week-04-2014/refseq_protein.00.pin
    More importantly we need to make sure that the overall index file is in place. Mine is at the bottom of the listing so that if I do a 'tail --lines=2' instead of using 'head' I get:

    /group/diagrid/databases/ncbi/week-04-2014/refseq_protein.09.psq
    /group/diagrid/databases/ncbi/week-04-2014/refseq_protein.pal
    Or using 'ls -l'

    -rw-r--r-- 1 braub diagrid-apps 275 Dec 15 20:12 /group/diagrid/databases/ncbi/week-04-2014/refseq_protein.pal
    What do you get? I am trying to see if the '*.pal' file is present.

    Leave a comment:


  • OTU
    replied
    Ok, so far:
    $ ls -l /home/bwawrik/db/refseq_protein* | head --lines=2
    -rw-rw-r-- 1 bwawrik bwawrik 534122462 Dec 15 18:37 /home/bwawrik/db/refseq_protein.01.phr
    -rw-rw-r-- 1 bwawrik bwawrik 23105152 Dec 15 18:37 /home/bwawrik/db/refseq_protein.01.pin

    and when using blastp without star:
    BLAST Database error: No alias or index file found for protein database [/home/bwawrik/db/refseq_protein] in search path [/home/bwawrik/software/multi-metagenome/R.data.generation::]

    Leave a comment:


  • westerman
    replied
    Don't use a star (*) in your -db name. It should be:

    -db /home/bwawrik/db/refseq_protein

    Otherwise you are telling blastp that there are 81 (or so) files to use as the DB. It wants the overall name, not the overall files. Your initial blastp line did not have the star and thus it seemed correct except for the pathing problem. Your current blastp is obviously incorrect.

    As for your makeblastdb error ... you are doing it wrong. I was going to mention that but it is not relevant to why blastp is not working. Once again you are telling the program to use 81 files. The program is basically seeing:

    makeblastdb -in /home/bwawrik/db/refseq_protein.00.phr /home/bwawrik/db/refseq_protein.00.pin /home/bwawrik/db/refseq_protein.00.pnd ... etc.

    Which of course ruins the one (1) parameter that should be after '-in' and brings up the 'too many positional arguments' error.

    But as I said that is neither here nor there for running blastp. Let's not be concerned with makeblastdb.

    Going on ... are you sure you ran that 'ls' that I gave you? I specified 'refseq_protein*' not the 'refseq_protein.*' (with a dot) that ls complained about.

    Try the blastp without a star in the -db. And post the results of:

    ls /home/bwawrik/db/refseq_protein* | head --lines=2

    Leave a comment:


  • mastal
    replied
    Try just using

    -db /home/bwawrik/db/refseq_protein

    the full path, but only the prefix of the database name

    Leave a comment:


  • OTU
    replied
    No problem.
    So, from previous post, here is the full paths included:
    $ blastp -query /home/bwawrik/software/multi-metagenome/R.data.generation/IIa.orfs.hmm.faa.db -db /home/bwawrik/db/refseq_protein.* -evalue 1e-5 -num_threads 60 -max_target_seqs 5 -outfmt 5 -out IIa.orfs.hmm.blast.xml

    And when I used the command:
    $ls -l ~/refseq_protein* | head --lines=2

    I got an error:
    >>ls: cannot access home/bwawrik/db/refseq_protein.*: No such file or directory

    I just cannot understand: if it doesn't "see" the files of database, how it gave an error during running of makeblastdb:
    $makeblastdb -in /home/bwawrik/db/refseq_protein.* -dbtype prot -out /home/bwawrik/db/refseq_protein.db
    >>Error: Too many positional arguments (1), the offending value: /home/bwawrik/db/refseq_protein.01.phr

    Because from this, it seems that it CAN actually read the file, but is simply not "happy" with it.

    Leave a comment:


  • westerman
    replied
    Well, once again not seeing an 'ls' of your directory and not seeing the actual program line you are using (since you edited it), it becomes hard to troubleshoot the problem. Almost all of the time when someone posts about a file not being found it is because they are not using the correct path for the file despite what they think. In other words the file simply isn't there. I've done it about a zillion times myself.

    Going by your statement "there 81 files in the folder db in home directory" then your original blastp line is incorrect since you are *not* using the folder 'db in home directory'. Instead you are just using your home directory.

    Please check your paths. If nothing else do an:

    ls -l ~/refseq_protein* | head --lines=2

    And post that.

    Leave a comment:


  • OTU
    replied
    This is what I have done - there were everywhere specified full paths (I just eliminated them from the question). And yes, there are 81 files in the folder db in home directory...

    Leave a comment:


  • westerman
    replied
    1) I suggest using full path names instead of '~'.

    2) To help troubleshoot cases of 'no file found' it is handy for us to see an 'ls' of the directory in question just to make sure you haven't done a mistake such as specifying the wrong directory.

    3) Pre-formatted refseq_protein should be 81 files -- no folders involved.

    Leave a comment:


  • OTU
    replied
    Hi everyone!
    My problem is somehow realted to this described issue. I am using a refseq)protein database, downloaded from ncbi ftp, which consists of in total 9 folders with files .pni .pnd .pog and so on. When I am using the command
    $blastp -query ~/IIa.orfs.hmm.faa.db -db ~/refseq_protein -evalue 1e-5 -num_threads 60 -max_target_seqs 5 -outfmt 5 -out IIa.orfs.hmm.blast.xml

    I am getting an error:
    >>BLAST Database error: No alias or index file found for protein database [.../db/refseq_protein] in search path [.../software/multi-metagenome/R.data.generation::]

    Any ideas what is happening?
    I even tried to use makeblastdb command, to format my databases, but it doesn't work as well.
    $ makeblastdb -in ~/refseq_protein.*.* -dbtype prot -out ~/db/refseq_protein.db
    >>Error: Too many positional arguments (1), the offending value: ~/db/refseq_protein.01.phr

    Need help!!!!

    otu

    Leave a comment:

Latest Articles

Collapse

  • seqadmin
    Best Practices for Single-Cell Sequencing Analysis
    by seqadmin



    While isolating and preparing single cells for sequencing was historically the bottleneck, recent technological advancements have shifted the challenge to data analysis. This highlights the rapidly evolving nature of single-cell sequencing. The inherent complexity of single-cell analysis has intensified with the surge in data volume and the incorporation of diverse and more complex datasets. This article explores the challenges in analysis, examines common pitfalls, offers...
    06-06-2024, 07:15 AM
  • seqadmin
    Latest Developments in Precision Medicine
    by seqadmin



    Technological advances have led to drastic improvements in the field of precision medicine, enabling more personalized approaches to treatment. This article explores four leading groups that are overcoming many of the challenges of genomic profiling and precision medicine through their innovative platforms and technologies.

    Somatic Genomics
    “We have such a tremendous amount of genetic diversity that exists within each of us, and not just between us as individuals,”...
    05-24-2024, 01:16 PM

ad_right_rmr

Collapse

News

Collapse

Topics Statistics Last Post
Started by seqadmin, Today, 08:58 AM
0 responses
8 views
0 likes
Last Post seqadmin  
Started by seqadmin, Yesterday, 02:20 PM
0 responses
14 views
0 likes
Last Post seqadmin  
Started by seqadmin, 06-07-2024, 06:58 AM
0 responses
181 views
0 likes
Last Post seqadmin  
Started by seqadmin, 06-06-2024, 08:18 AM
0 responses
231 views
0 likes
Last Post seqadmin  
Working...
X