Seqanswers Leaderboard Ad

**maubp** · 10-18-2011, 09:20 AM

What version numbers? That can make a difference.

I don't use -num_descriptions and -num_alignments having found them behaving oddly (something at least partially addressed in a recent BLAST+ release). Have you tried with -max_target_seqs instead?

**rskr** · 10-18-2011, 09:30 AM

I don't think you can look at the evalues and say they are lower therefore better, or returning more or fewer hits. The statistics aren't comparable without calibrating the Karlin-Altschul parameters. I am suspicious of blast+, because it is so fast I suspect that they tweaked the hash word size parameters in favor of speed rather than accuracy. You might want to compare the the actual parameters that are used for example, look at what parameters blastall runs blastn with then compare them with blast+, which is the equivalent of blastn. There is a way to get them to print the actual parameters, not just the parameters of the wrapper. My understanding is that there isn't much difference in the two but mostly if there was a difference it was the parametrization that the wrappers used.

**Symphysodon** · 10-19-2011, 03:30 AM

Hi,

Thanks maubp and rskr for your feedback.

I am using blastn from blastall 2.2.23 and blastn from BLAST+ 2.2.25.

Perhaps if I specify for both to use the same hash word size, that might be a more equivalent comparison. Note that I have specified for both to have the dust filters turned OFF.

I'll try -max_target_seqs in BLAST+. Do you know what the equivalent parameter in BLASTALL is?

I specified -num_descriptions and -num_alignments for BLAST+ blastn as the legacy_blast.pl returned them as the equivalent of -b and -v in BLASTALL blastn.

If anyone can let me know how to get both applications to print out all the actual default parameters they used, that'd be great.

Cheers!

**mdimon** · 10-25-2011, 02:52 PM

The difference in the number of hits between the default and csv formats is that the -b and -v parameters are only followed for the default format. In the csv format, the -b and -v parameters are ignored.

In BLAST+ this was remedied by the introduction of the -max_target_seqs parameter. The documentation suggests that for the default format, the -num_descriptions and -num_alignments options should be used but for XML and tabular output, the -max_target_seqs options should be used instead.

As far as seeing different results between old and new BLASTs, have you figured out what type of sequences lead to different results?

Topics	Statistics	Last Post
Expanding the Horizons of Cellular Research with the Single Cell Atlas by seqadmin Started by seqadmin, Yesterday, 11:49 AM	0 responses 15 views 0 likes	Last Post by seqadmin Yesterday, 11:49 AM
Genetic Variants and Diabetes Risk in Childhood Cancer Survivors by seqadmin Started by seqadmin, 04-24-2024, 08:47 AM	0 responses 16 views 0 likes	Last Post by seqadmin 04-24-2024, 08:47 AM
Cancer Metastasis: A Deep Dive into Cellular Plasticity by seqadmin Started by seqadmin, 04-11-2024, 12:08 PM	0 responses 61 views 0 likes	Last Post by seqadmin 04-11-2024, 12:08 PM
Proteogenomic Profiles Offer New Clues in Prostate Cancer by seqadmin Started by seqadmin, 04-10-2024, 10:19 PM	0 responses 60 views 0 likes	Last Post by seqadmin 04-10-2024, 10:19 PM

Seqanswers Leaderboard Ad

Announcement

BLAST+ vs BLASTALL (legacy BLAST)

Comment

Comment

Comment

Comment

Latest Articles

ad_right_rmr

News