Seqanswers Leaderboard Ad

Collapse

Announcement

Collapse
No announcement yet.
X
 
  • Filter
  • Time
  • Show
Clear All
new posts

  • tblastx fmt1 Output Interpretation

    Hello,
    While locally running "blastx" with output option "-outfmt 1", I have a question.

    The below is an example of my result. As seen, throughout the outputs, all pairs of anchored proteins are not matched. How can I interpret these non-identical alignment?

    Query_3 477 GSVEYVHMLNGTMCATTRTVCAL 409
    medicagoRALF3 38 .GM.WI.QTKTAT.EGSIAD.M. 60

    Query_4 34 PYHLLLQKFYKT 69
    medicagoRALF11 104 ..NRGCS.Y.RC 115
    Thank you in advance.
    Attached Files
    Last edited by syintel87; 03-06-2015, 08:56 AM.

  • #2
    Are you searching with very short query sequences (like illumina reads)?

    Comment


    • #3
      My query sequences are contigs that are de novo assembled, some of which are short (e.g. 500) whereas others are very long (e.g. 100,000).

      But database of blast is composed of several peptide proteins whose length is short (e.g. 44).

      Comment


      • #4
        Have you tried to do the search the other way around (using your peptides as query)?

        Try using BLAT too. Especially if you know that you expect the peptides to be there in your data.

        Comment


        • #5
          In addition to "-outfmt 1", I tried other options as well.
          There seem to be different ways of alignment.

          0 = pairwise
          1 = query-anchored showing identities
          2 = query-anchored no identities
          3 = flat query-anchored, show identities
          4 = flat query-anchored, no identities

          [-outfmt 0]
          Query_2 229 KMSFRYLFFAIKKYALSKF 173
          thalianaRALF4 5 ...LTS...VSIVIV..L. 23

          [-outfmt 1]
          Query_2 229 KMSFRYLFFAIKKYALSKF 173
          thalianaRALF4 5 ...LTS...VSIVIV..L. 23

          [-outfmt 2]
          Query_2 229 KMSFRYLFFAIKKYALSKF 173
          thalianaRALF4 5 KMSLTSLFFVSIVIVLSLF 23

          [-outfmt 3]
          Query_2 229 KMSFRYLFFAIKKYALSKF 173
          thalianaRALF4 5 ...LTS...VSIVIV..L. 23

          [-outfmt 4]
          Query_2 229 KMSFRYLFFAIKKYALSKF 173
          thalianaRALF4 5 KMSLTSLFFVSIVIVLSLF 23
          The results of [-outfmt 2] and [-outfmt 4] may be the results that I look forward to getting. However, I still cannot understand the principles and differences that distinguish output format 0 to 4.

          Comment


          • #6
            Ah, now I see! In formats "0, 1, and 3", dots stand for identities between query and target. And differences are shown with protein letters.

            I should have posted after more consideration.
            Thanks GenoMax! I am going to try "BLAT", too.

            Comment

            Latest Articles

            Collapse

            • seqadmin
              Latest Developments in Precision Medicine
              by seqadmin



              Technological advances have led to drastic improvements in the field of precision medicine, enabling more personalized approaches to treatment. This article explores four leading groups that are overcoming many of the challenges of genomic profiling and precision medicine through their innovative platforms and technologies.

              Somatic Genomics
              “We have such a tremendous amount of genetic diversity that exists within each of us, and not just between us as individuals,”...
              05-24-2024, 01:16 PM
            • seqadmin
              Recent Advances in Sequencing Analysis Tools
              by seqadmin


              The sequencing world is rapidly changing due to declining costs, enhanced accuracies, and the advent of newer, cutting-edge instruments. Equally important to these developments are improvements in sequencing analysis, a process that converts vast amounts of raw data into a comprehensible and meaningful form. This complex task requires expertise and the right analysis tools. In this article, we highlight the progress and innovation in sequencing analysis by reviewing several of the...
              05-06-2024, 07:48 AM

            ad_right_rmr

            Collapse

            News

            Collapse

            Topics Statistics Last Post
            Started by seqadmin, 05-24-2024, 07:15 AM
            0 responses
            15 views
            0 likes
            Last Post seqadmin  
            Started by seqadmin, 05-23-2024, 10:28 AM
            0 responses
            18 views
            0 likes
            Last Post seqadmin  
            Started by seqadmin, 05-23-2024, 07:35 AM
            0 responses
            21 views
            0 likes
            Last Post seqadmin  
            Started by seqadmin, 05-22-2024, 02:06 PM
            0 responses
            10 views
            0 likes
            Last Post seqadmin  
            Working...
            X