Seqanswers Leaderboard Ad

Collapse

Announcement

Collapse
No announcement yet.
X
 
  • Filter
  • Time
  • Show
Clear All
new posts

  • tblastx fmt1 Output Interpretation

    Hello,
    While locally running "blastx" with output option "-outfmt 1", I have a question.

    The below is an example of my result. As seen, throughout the outputs, all pairs of anchored proteins are not matched. How can I interpret these non-identical alignment?

    Query_3 477 GSVEYVHMLNGTMCATTRTVCAL 409
    medicagoRALF3 38 .GM.WI.QTKTAT.EGSIAD.M. 60

    Query_4 34 PYHLLLQKFYKT 69
    medicagoRALF11 104 ..NRGCS.Y.RC 115
    Thank you in advance.
    Attached Files
    Last edited by syintel87; 03-06-2015, 08:56 AM.

  • #2
    Are you searching with very short query sequences (like illumina reads)?

    Comment


    • #3
      My query sequences are contigs that are de novo assembled, some of which are short (e.g. 500) whereas others are very long (e.g. 100,000).

      But database of blast is composed of several peptide proteins whose length is short (e.g. 44).

      Comment


      • #4
        Have you tried to do the search the other way around (using your peptides as query)?

        Try using BLAT too. Especially if you know that you expect the peptides to be there in your data.

        Comment


        • #5
          In addition to "-outfmt 1", I tried other options as well.
          There seem to be different ways of alignment.

          0 = pairwise
          1 = query-anchored showing identities
          2 = query-anchored no identities
          3 = flat query-anchored, show identities
          4 = flat query-anchored, no identities

          [-outfmt 0]
          Query_2 229 KMSFRYLFFAIKKYALSKF 173
          thalianaRALF4 5 ...LTS...VSIVIV..L. 23

          [-outfmt 1]
          Query_2 229 KMSFRYLFFAIKKYALSKF 173
          thalianaRALF4 5 ...LTS...VSIVIV..L. 23

          [-outfmt 2]
          Query_2 229 KMSFRYLFFAIKKYALSKF 173
          thalianaRALF4 5 KMSLTSLFFVSIVIVLSLF 23

          [-outfmt 3]
          Query_2 229 KMSFRYLFFAIKKYALSKF 173
          thalianaRALF4 5 ...LTS...VSIVIV..L. 23

          [-outfmt 4]
          Query_2 229 KMSFRYLFFAIKKYALSKF 173
          thalianaRALF4 5 KMSLTSLFFVSIVIVLSLF 23
          The results of [-outfmt 2] and [-outfmt 4] may be the results that I look forward to getting. However, I still cannot understand the principles and differences that distinguish output format 0 to 4.

          Comment


          • #6
            Ah, now I see! In formats "0, 1, and 3", dots stand for identities between query and target. And differences are shown with protein letters.

            I should have posted after more consideration.
            Thanks GenoMax! I am going to try "BLAT", too.

            Comment

            Latest Articles

            Collapse

            • seqadmin
              Essential Discoveries and Tools in Epitranscriptomics
              by seqadmin


              The field of epigenetics has traditionally concentrated more on DNA and how changes like methylation and phosphorylation of histones impact gene expression and regulation. However, our increased understanding of RNA modifications and their importance in cellular processes has led to a rise in epitranscriptomics research. “Epitranscriptomics brings together the concepts of epigenetics and gene expression,” explained Adrien Leger, PhD, Principal Research Scientist on Modified Bases...
              Yesterday, 07:01 AM
            • seqadmin
              Current Approaches to Protein Sequencing
              by seqadmin


              Proteins are often described as the workhorses of the cell, and identifying their sequences is key to understanding their role in biological processes and disease. Currently, the most common technique used to determine protein sequences is mass spectrometry. While still a valuable tool, mass spectrometry faces several limitations and requires a highly experienced scientist familiar with the equipment to operate it. Additionally, other proteomic methods, like affinity assays, are constrained...
              04-04-2024, 04:25 PM

            ad_right_rmr

            Collapse

            News

            Collapse

            Topics Statistics Last Post
            Started by seqadmin, 04-11-2024, 12:08 PM
            0 responses
            40 views
            0 likes
            Last Post seqadmin  
            Started by seqadmin, 04-10-2024, 10:19 PM
            0 responses
            41 views
            0 likes
            Last Post seqadmin  
            Started by seqadmin, 04-10-2024, 09:21 AM
            0 responses
            36 views
            0 likes
            Last Post seqadmin  
            Started by seqadmin, 04-04-2024, 09:00 AM
            0 responses
            55 views
            0 likes
            Last Post seqadmin  
            Working...
            X