Unconfigured Ad

Collapse
X
 
  • Filter
  • Time
  • Show
Clear All
new posts
  • arredondoea
    Junior Member
    • Mar 2024
    • 4

    Problem with blastdbcmd. “Error: [blastdbcmd] Skipped TRANSCRIPT/4865”

    Hello, I am looking for help with running a Reciprocal BLAST. I cannot query any of my transcripts using blastdbcmd because I get this error message: “Error: [blastdbcmd] Skipped TRANSCRIPT_4865”. I appreciate any insight that can help resolve this issue!

    I am using high quality FASTA files with compiled transcripts of the subspecies I am working with. For context, I ran LORDEC on my transcript files to reduce redundancy. Now, I am working on Reciprocal Blasting to identify comparable contigs between subspecies.


    This is the script I am using for the Reciprocal BLAST (macOS):

    #Making a database

    makeblastdb -in ~/Desktop/tegula_blast/Hq_transcripts_2.fa -dbtype 'nucl' -out ~/Desktop/tegula_blast/Tfunebralis_DB -parse_seqids


    #Output message

    Building a new DB, current time: 03/06/2024 11:23:46

    New DB name: /Users/lanigleason/Desktop/tegula_blast/Tfunebralis_DB

    New DB title: /Users/lanigleason/Desktop/tegula_blast/Hq_transcripts_2.fa

    Sequence type: Nucleotide

    Keep MBits: T

    Maximum file size: 3000000000B

    Adding sequences from FASTA; added 99388 sequences in 2.75653 seconds.



    #Blast for all possible pairwise comparisons

    blastn -query ~/Desktop/tegula_blast/Hq_transcripts_1.fa -db ~/Desktop/tegula_blast/Tfunebralis_DB -out ~/Desktop/tegula_blast/eiseni_to_funebralis.txt -evalue 1E-20 -outfmt 6 -max_target_seqs 1


    #Output message

    Warning: [blastn] Examining 5 or more matches is recommended

    (See attached photo for output file)


    #Retrieve subset of assembly using blastdbcmd

    Here, I took the queried transcripts with matches, from previous blastn output (eiseni_to_funebralis.txt) and made a list of the transcript names (eiseni_names.txt).

    An observation I made here is that there are multiple matches listed in the blastn output file. My original fasta file with transcripts does not include multiple copies, so this has to be a product of the blastn command. I am wondering if this is a possible reason for the error in the next step.



    blastdbcmd -db ~/Desktop/tegula_blast/Tfunebralis_DB -dbtype 'nucl' -entry_batch ~/Desktop/tegula_blast/eiseni_names.txt -out ~/Desktop/tegula_blast/eiseni_to_funebralis.subset.reciprocal.fasta


    #Output message

    Error: [blastdbcmd] Skipped transcript_4866

    Error: [blastdbcmd] Skipped transcript_9439

    Error: [blastdbcmd] Skipped transcript_9439

    Error: [blastdbcmd] Skipped transcript_13586

    Error: [blastdbcmd] Skipped transcript_17909

    Error: [blastdbcmd] Skipped transcript_38053

    Error: [blastdbcmd] Skipped transcript_38053

    Error: [blastdbcmd] Skipped transcript_22088

    Error: [blastdbcmd] Skipped transcript_34418

    Error: [blastdbcmd] Skipped transcript_45393

    Error: [blastdbcmd] Skipped transcript_45393

    Error: [blastdbcmd] Skipped transcript_13587

    Error: [blastdbcmd] Skipped transcript_13587

    Error: [blastdbcmd] Skipped transcript_13587

    etc…


    Most of the transcripts are skipped in this step. As seen, all duplicates of the same transcript are skipped too.

    I have tried using a smaller subset of transcripts and removing the duplicate matches to run the reciprocal blast but I encounter the same errors.


    I also tried looking up database identifiers that I could be missing but I am not sure how to use these identifiers to query with blastdbcmd.

    blastdbcmd -db ~/Desktop/tegula_blast/Tfunebralis_DB -dbtype 'nucl' -entry all -out ~/Desktop/tegula_blast/eiseni_to_funebralis.subset.reciprocal.all.fasta -outfmt "OID: %o GI: %g ACC: %a IDENTIFIER: %i"

    #Example output
    OID: 0 GI: N/A ACC: transcript_3756 IDENTIFIER: lcl|transcript_3756
    OID: 1 GI: N/A ACC: transcript_9791 IDENTIFIER: lcl|transcript_9791
    OID: 2 GI: N/A ACC: transcript_7816 IDENTIFIER: lcl|transcript_7816
    OID: 3 GI: N/A ACC: transcript_1853 IDENTIFIER: lcl|transcript_1853​


    Please let me know any ways to reduce the amount of transcripts skipped by blastdbcmd. Any recommended parameter changes and other explanations are welcomed. Thank you!

    Attached Files
    Last edited by arredondoea; 03-06-2024, 12:43 PM.

Latest Articles

Collapse

  • SEQadmin2
    Nine Things a Sample Prep Scientist Thinks About Before Sequencing
    by SEQadmin2


    I’m not a sequencing expert. I’m a purification scientist who uses NGS to evaluate workflows my group develops. With this perspective, we think about the sample first and the NGS workflow second. The sequencer is an exceptionally honest reporter, but it can only report on what you give it, so whether you get clean, interpretable data from an NGS workflow is largely determined before you begin.


    Here are nine questions we think about, in roughly the order they matter, before...
    06-18-2026, 07:11 AM
  • SEQadmin2
    From Collection to Sequencing: Why Sample Preparation and Preservation Define Sequencing Data
    by SEQadmin2


    Data variability is still an issue in sequencing technologies despite the advances in reproducibility and accuracy of these platforms. But the problem does not originate in the sequencing itself, but in the previous steps, before the sample reaches the sequencer.


    The first step is collection, followed by preservation and sample preparation for analysis. Most scientists overlook those steps, but not being careful might just be skewing the experiment’s results.
    ...
    06-02-2026, 10:05 AM

ad_right_rmr

Collapse

News

Collapse

Topics Statistics Last Post
Started by SEQadmin2, 06-17-2026, 06:09 AM
0 responses
30 views
0 reactions
Last Post SEQadmin2  
Started by SEQadmin2, 06-09-2026, 11:58 AM
0 responses
44 views
0 reactions
Last Post SEQadmin2  
Started by SEQadmin2, 06-05-2026, 10:09 AM
0 responses
50 views
0 reactions
Last Post SEQadmin2  
Started by SEQadmin2, 06-04-2026, 08:59 AM
0 responses
51 views
0 reactions
Last Post SEQadmin2  
Working...