Header Leaderboard Ad

Collapse

Combine two FASTA files

Collapse

Announcement

Collapse

SEQanswers June Challenge Has Begun!

The competition has begun! We're giving away a $50 Amazon gift card to the member who answers the most questions on our site during the month. We want to encourage our community members to share their knowledge and help each other out by answering questions related to sequencing technologies, genomics, and bioinformatics. The competition is open to all members of the site, and the winner will be announced at the beginning of July. Best of luck!

For a list of the official rules, visit (https://www.seqanswers.com/forum/sit...wledge-and-win)
See more
See less
X
 
  • Filter
  • Time
  • Show
Clear All
new posts

  • Combine two FASTA files

    I have a couple of fasta files I want to combine, two and two. They're >1 GB so reading them in R is too time consuming. I'm running a Mac on which I can't get Biopython to work and I don't find any tool for this specific purpose on Galaxy.

    Does anybody have any suggestion/solution for my problem?

  • #2
    Can't you just concatenate the two files using cat if there are no duplicates?

    Comment


    • #3
      I was also going to suggest the command line tool cat, e.g.

      Code:
      cat file1.fasta file2.fasta > combined.fasta
      Biopython should work fine on the Mac - using the Apple provided Python, the Python.org version, etc. Please seek help on our mailing list, or start a new thread here.

      Comment


      • #4
        vivek_: Thanks, that worked quite beautifully. I would never have expected the solution to be that trivial.

        maubp: Yes, Biopython should work but it's not too easy to install NumPy and SciPy properly, I've found. I'm no terminal wizz.

        Comment


        • #5
          Biopython doesn't need SciPy, and (on some versions of Mac OS at least) NumPy is already provided by Apple (although not the latest version). You can also install Biopython without NumPy (thins like PDB 3D structures won't work).

          Comment


          • #6
            Hi Calici

            Try SeqNinja from DNASTAR
            Bioinformatics software innovators since 1984, DNASTAR Lasergene provides a complete software solution for life science researchers.

            Comment

            Latest Articles

            Collapse

            ad_right_rmr

            Collapse

            News

            Collapse

            Topics Statistics Last Post
            Started by seqadmin, Yesterday, 01:08 PM
            0 responses
            6 views
            0 likes
            Last Post seqadmin  
            Started by seqadmin, 06-01-2023, 08:56 PM
            0 responses
            12 views
            0 likes
            Last Post seqadmin  
            Started by seqadmin, 06-01-2023, 07:33 AM
            0 responses
            128 views
            0 likes
            Last Post seqadmin  
            Started by seqadmin, 05-31-2023, 07:50 AM
            0 responses
            166 views
            0 likes
            Last Post seqadmin  
            Working...
            X