Seqanswers Leaderboard Ad

Collapse
X
 
  • Filter
  • Time
  • Show
Clear All
new posts
  • cmnailecin
    Junior Member
    • Dec 2012
    • 4

    Newbie: extract subsequence

    Hello,

    I am new here and I was looking through previous posts on the topic which I fully intend to try. I have been using a program called fastahack (download here... https://github.com/ekg/fastahack) to extract subsequences from a fasta file (whole genome).

    IT has been working very well and is fairly simple to use. However, I need to extract multiple (more than 1000) subsequences so doing this manually would be out of the question.

    I would like to know if there is a way to use this program but have it extract multiple subsequences from a fasta file with different start and stop positions. I would like to use this same method on a different genome with, again, different start and stop positions so a 'generic' solution would be necessary.

    Please help if you can

    * perl script, incrementing loop??? edit program coding? batch files???
  • maasha
    Senior Member
    • Apr 2009
    • 153

    #2
    This question has been asked in this forum before IIRC. You can use Biopieces (www.biopieces.org) to solve this. Have a look at get_seq.

    Comment

    • Jeremy
      Senior Member
      • Nov 2009
      • 190

      #3
      you can do it using the seqinr R package for R quite easily with the getFrag command, but you need to know some R basics. If you do try this way make sure you do not set as.string to TRUE otherwise R needs to convert everything back to character first and takes twice as long.

      Comment

      • volks
        Member
        • Jun 2010
        • 80

        #4
        fastaFromBed from bedtools?

        Comment

        • cmnailecin
          Junior Member
          • Dec 2012
          • 4

          #5
          Thank you all! At this point I am going to be using get_seq from biopieces. The only downfall to this is the many prerequisites but I'm working through them. Thank you very much maasha.

          Comment

          Latest Articles

          Collapse

          • seqadmin
            Pathogen Surveillance with Advanced Genomic Tools
            by seqadmin




            The COVID-19 pandemic highlighted the need for proactive pathogen surveillance systems. As ongoing threats like avian influenza and newly emerging infections continue to pose risks, researchers are working to improve how quickly and accurately pathogens can be identified and tracked. In a recent SEQanswers webinar, two experts discussed how next-generation sequencing (NGS) and machine learning are shaping efforts to monitor viral variation and trace the origins of infectious...
            03-24-2025, 11:48 AM
          • seqadmin
            New Genomics Tools and Methods Shared at AGBT 2025
            by seqadmin


            This year’s Advances in Genome Biology and Technology (AGBT) General Meeting commemorated the 25th anniversary of the event at its original venue on Marco Island, Florida. While this year’s event didn’t include high-profile musical performances, the industry announcements and cutting-edge research still drew the attention of leading scientists.

            The Headliner
            The biggest announcement was Roche stepping back into the sequencing platform market. In the years since...
            03-03-2025, 01:39 PM

          ad_right_rmr

          Collapse

          News

          Collapse

          Topics Statistics Last Post
          Started by seqadmin, 03-20-2025, 05:03 AM
          0 responses
          41 views
          0 reactions
          Last Post seqadmin  
          Started by seqadmin, 03-19-2025, 07:27 AM
          0 responses
          51 views
          0 reactions
          Last Post seqadmin  
          Started by seqadmin, 03-18-2025, 12:50 PM
          0 responses
          38 views
          0 reactions
          Last Post seqadmin  
          Started by seqadmin, 03-03-2025, 01:15 PM
          0 responses
          193 views
          0 reactions
          Last Post seqadmin  
          Working...