Seqanswers Leaderboard Ad

Collapse

Announcement

Collapse
No announcement yet.
X
 
  • Filter
  • Time
  • Show
Clear All
new posts

  • Extracting unique BLAST hits from a multi-hit region

    Hi all,

    I have a blast output file (from BLAST command line tool) and I want to extract from it the unique hits. I have several genomic regions of variable size and I explore them using BLAST against a home made database (there is redundancy in it). What I want to get out is one hit per fragment of the examined region. Any suggestions? I am using perl, so any answers using BioPerl would be highly appreciated.

    Thanks a lot in advance.
    Thanos

  • #2
    If you used tabular output, it would be rather trivial to parse it with cut and sort and then retrieve the sequences from your db with blastdbcmd..
    Last edited by rhinoceros; 07-02-2013, 09:23 AM.
    savetherhino.org

    Comment


    • #3
      Hi rhinoceros and thanks a lot for your immediate response. By tabular output do you mean option 6 in the -outfmt flag?

      I will try what you suggested, seems indeed fairly simple.

      Thanks once again,
      Thanos

      Comment


      • #4
        Yeah, -outfmt 6

        Also, have a look here. It's probably rather relevant (second post)
        savetherhino.org

        Comment


        • #5
          Thanks a lot once again for your help. I really appreciate that.

          Comment


          • #6
            Do you probably know how I can add subject description in the tabular output? I need them for the next steps in my pipeline. :-)

            Thanks in advance.

            Comment


            • #7
              Have a look at the manual

              syntax would be e.g. -outfmt '6 std stitle'

              However, since your db is not provided by ncbi, this will not work (or so I'd reason). Anyway, if you created your db with -parse_seqids and the descriptions are stored there in the headers, you can just take the hits from your blast and retrieve the sequences and descriptions with blastdbcmd.

              cut -f 1 yourTabOutput | sort -u > id_list (or maybe it's cut -f 2..)
              blastdbcmd -entry_batch id_list ..etc
              Last edited by rhinoceros; 07-03-2013, 03:42 AM.
              savetherhino.org

              Comment


              • #8
                Thank you once again for your response rhinoceros. I will go through it right now and check the option you suggested.

                Comment

                Latest Articles

                Collapse

                • seqadmin
                  Recent Advances in Sequencing Analysis Tools
                  by seqadmin


                  The sequencing world is rapidly changing due to declining costs, enhanced accuracies, and the advent of newer, cutting-edge instruments. Equally important to these developments are improvements in sequencing analysis, a process that converts vast amounts of raw data into a comprehensible and meaningful form. This complex task requires expertise and the right analysis tools. In this article, we highlight the progress and innovation in sequencing analysis by reviewing several of the...
                  05-06-2024, 07:48 AM
                • seqadmin
                  Essential Discoveries and Tools in Epitranscriptomics
                  by seqadmin




                  The field of epigenetics has traditionally concentrated more on DNA and how changes like methylation and phosphorylation of histones impact gene expression and regulation. However, our increased understanding of RNA modifications and their importance in cellular processes has led to a rise in epitranscriptomics research. “Epitranscriptomics brings together the concepts of epigenetics and gene expression,” explained Adrien Leger, PhD, Principal Research Scientist...
                  04-22-2024, 07:01 AM

                ad_right_rmr

                Collapse

                News

                Collapse

                Topics Statistics Last Post
                Started by seqadmin, 05-14-2024, 07:03 AM
                0 responses
                20 views
                0 likes
                Last Post seqadmin  
                Started by seqadmin, 05-10-2024, 06:35 AM
                0 responses
                44 views
                0 likes
                Last Post seqadmin  
                Started by seqadmin, 05-09-2024, 02:46 PM
                0 responses
                54 views
                0 likes
                Last Post seqadmin  
                Started by seqadmin, 05-07-2024, 06:57 AM
                0 responses
                42 views
                0 likes
                Last Post seqadmin  
                Working...
                X