Hello.
I am using local BLASTP program to blast my protein sequences (around 600 sequences) against other organism's whole proteome database. What I want to do is to filter the output by some parameters. I have applied a reciprocal best_hit strategy, and the parameters were set as 'max identity > 30%', e<1e-10 and best_hit_overhang 0.25). However, in some cases, for example, the max identity between one best-hit set of query and subject sequence was as high as 40%, the query coverage was only 2%. Therefor, I want to exclude the best-hit sequences like that by setting a parameter like 'query coverage > 30%'.
The web based blast on NCBI could show the query coverage, but I do not know what is the command line for local BLASTP to output "query coverage". Please give me some ideas to resolve this problem. Thanks!
I am using local BLASTP program to blast my protein sequences (around 600 sequences) against other organism's whole proteome database. What I want to do is to filter the output by some parameters. I have applied a reciprocal best_hit strategy, and the parameters were set as 'max identity > 30%', e<1e-10 and best_hit_overhang 0.25). However, in some cases, for example, the max identity between one best-hit set of query and subject sequence was as high as 40%, the query coverage was only 2%. Therefor, I want to exclude the best-hit sequences like that by setting a parameter like 'query coverage > 30%'.
The web based blast on NCBI could show the query coverage, but I do not know what is the command line for local BLASTP to output "query coverage". Please give me some ideas to resolve this problem. Thanks!
Comment