Seqanswers Leaderboard Ad

Collapse

Announcement

Collapse
No announcement yet.
X
 
  • Filter
  • Time
  • Show
Clear All
new posts

  • How to Extract Multiple Sequence from Multi Fasta File by ID list

    Hi,
    I have a list of ids in .txt format and a multi fasta file with sequences. I need to extract sequences with the IDs in the list.

    Can you help me, please?

  • #2
    I think you can do that using seqret which is part of EMBOSS. According to the documentation the paramater -iquery1 can be used to specify a list of IDs, although probably not a file with IDs...

    Comment


    • #3
      Do you program? You can do that with a few lines using a library like Biopython.

      Alternatively, if you have a local Galaxy you could ask your admin to install one of these tools: http://toolshed.g2.bx.psu.edu/view/p...q_filter_by_id or http://toolshed.g2.bx.psu.edu/view/p...q_select_by_id

      Comment


      • #4
        If there are no linebreaks in the sequences, then

        Code:
        grep -A1 -w -f id.txt seqFile.fasta > output.fasta
        should work. The ids have to be identical to the fasta headers including the greater than sign.
        savetherhino.org

        Comment


        • #5
          faSomeRecords from Kent utilities is the simplest solution (http://hgdownload.soe.ucsc.edu/admin/exe/linux.x86_64/)

          More here: http://seqanswers.com/forums/showpos...0&postcount=13

          Comment

          Latest Articles

          Collapse

          • seqadmin
            Non-Coding RNA Research and Technologies
            by seqadmin




            Non-coding RNAs (ncRNAs) do not code for proteins but play important roles in numerous cellular processes including gene silencing, developmental pathways, and more. There are numerous types including microRNA (miRNA), long ncRNA (lncRNA), circular RNA (circRNA), and more. In this article, we discuss innovative ncRNA research and explore recent technological advancements that improve the study of ncRNAs.

            Nobel Prize for MicroRNA Discovery
            This week,...
            10-07-2024, 08:07 AM
          • seqadmin
            Recent Developments in Metagenomics
            by seqadmin





            Metagenomics has improved the way researchers study microorganisms across diverse environments. Historically, studying microorganisms relied on culturing them in the lab, a method that limits the investigation of many species since most are unculturable1. Metagenomics overcomes these issues by allowing the study of microorganisms regardless of their ability to be cultured or the environments they inhabit. Over time, the field has evolved, especially with the advent...
            09-23-2024, 06:35 AM

          ad_right_rmr

          Collapse

          News

          Collapse

          Topics Statistics Last Post
          Started by seqadmin, Today, 06:35 AM
          0 responses
          6 views
          0 likes
          Last Post seqadmin  
          Started by seqadmin, Yesterday, 02:44 PM
          0 responses
          7 views
          0 likes
          Last Post seqadmin  
          Started by seqadmin, 10-11-2024, 06:55 AM
          0 responses
          14 views
          0 likes
          Last Post seqadmin  
          Started by seqadmin, 10-02-2024, 04:51 AM
          0 responses
          110 views
          0 likes
          Last Post seqadmin  
          Working...
          X