Seqanswers Leaderboard Ad

**GenoMax** · 03-05-2015, 12:58 PM

Are you searching with very short query sequences (like illumina reads)?

**syintel87** · 03-06-2015, 08:26 AM

My query sequences are contigs that are de novo assembled, some of which are short (e.g. 500) whereas others are very long (e.g. 100,000).

But database of blast is composed of several peptide proteins whose length is short (e.g. 44).

**GenoMax** · 03-06-2015, 08:36 AM

Have you tried to do the search the other way around (using your peptides as query)?

Try using BLAT too. Especially if you know that you expect the peptides to be there in your data.

**syintel87** · 03-06-2015, 09:21 AM

In addition to "-outfmt 1", I tried other options as well.
There seem to be different ways of alignment.

0 = pairwise
1 = query-anchored showing identities
2 = query-anchored no identities
3 = flat query-anchored, show identities
4 = flat query-anchored, no identities

[-outfmt 0]
Query_2 229 KMSFRYLFFAIKKYALSKF 173
thalianaRALF4 5 ...LTS...VSIVIV..L. 23

[-outfmt 1]
Query_2 229 KMSFRYLFFAIKKYALSKF 173
thalianaRALF4 5 ...LTS...VSIVIV..L. 23

[-outfmt 2]
Query_2 229 KMSFRYLFFAIKKYALSKF 173
thalianaRALF4 5 KMSLTSLFFVSIVIVLSLF 23

[-outfmt 3]
Query_2 229 KMSFRYLFFAIKKYALSKF 173
thalianaRALF4 5 ...LTS...VSIVIV..L. 23

[-outfmt 4]
Query_2 229 KMSFRYLFFAIKKYALSKF 173
thalianaRALF4 5 KMSLTSLFFVSIVIVLSLF 23

The results of [-outfmt 2] and [-outfmt 4] may be the results that I look forward to getting. However, I still cannot understand the principles and differences that distinguish output format 0 to 4.

**syintel87** · 03-06-2015, 09:33 AM

Ah, now I see! In formats "0, 1, and 3", dots stand for identities between query and target. And differences are shown with protein letters.

I should have posted after more consideration.
Thanks GenoMax! I am going to try "BLAT", too.

Topics	Statistics	Last Post
Gene Misexpression in the Healthy Human Population by seqadmin Started by seqadmin, 07-25-2024, 06:46 AM	0 responses 9 views 0 likes	Last Post by seqadmin 07-25-2024, 06:46 AM
New Method for Rapid Genetic Diagnosis of Mendelian Disorders by seqadmin Started by seqadmin, 07-24-2024, 11:09 AM	0 responses 26 views 0 likes	Last Post by seqadmin 07-24-2024, 11:09 AM
Advancing Nanopore Technology for Portable Sensing Devices by seqadmin Started by seqadmin, 07-19-2024, 07:20 AM	0 responses 160 views 0 likes	Last Post by seqadmin 07-19-2024, 07:20 AM
New RNA-Based Gene Writing Technology Achieves Precise Gene Integration by seqadmin Started by seqadmin, 07-16-2024, 05:49 AM	0 responses 127 views 0 likes	Last Post by seqadmin 07-16-2024, 05:49 AM

Seqanswers Leaderboard Ad

Announcement

tblastx fmt1 Output Interpretation

Comment

Comment

Comment

Comment

Comment

Latest Articles

ad_right_rmr

News