Header Leaderboard Ad

Collapse

converting fasta dna files to protein

Collapse

Announcement

Collapse
No announcement yet.
X
 
  • Filter
  • Time
  • Show
Clear All
new posts

  • converting fasta dna files to protein

    Hello,

    I have 200 human fasta dna files from a region of chr6. Each sequence is 5,500 bp each. I've combined these fasta files and uploaded them into Clustal Omega to generate multiple sequence alignments and phylogenetic tree.

    It worked well, however, I would like to convert these sequences into protein and highlights epitopes present in the sequences. What is the best tool to used for this purpose? What format do I need to select for the output?

    Thank you so much for your advice.

  • #2
    I've used transeq from the EMBOSS suite:

    https://www.ebi.ac.uk/Tools/st/emboss_transeq/

    There's a command-line version as well, as part of the free EMBOSS toolkit that is available in most Linux distributions:

    http://emboss.sourceforge.net/download/

    Comment


    • #3
      I've also written a nifty tool for nucleotide -> amino translation, but that does not appear to be the goal in this case. Rather, it appears that there are multiple small sequences in a 5500bp region that need to be translated with correct frames. To do so, you need to know the boundaries of these short peptide-encoding sequences, and ignore the non-coding sequence. I don't know of a tool which does that.

      Comment


      • #4
        Cross-posted: https://www.biostars.org/p/239422/

        Has been answered there.

        Comment


        • #5
          So far, I've tried EMBOS Transeq and the run aborted before generating anyting for some reason. I've combined multiple fasta files (size was under 1 MB). It worked when I tried it with a really small fasta file. I'm not sure what the problem is?

          I've also tried ExPasy tool and it generated an output on the screen but it's not clear to me how to download the result. Also, my final goal is to import the result into Culstal Omega to do the alignment so the format of the output has to be compatible. Should I stick with ExPasy?

          Thanks

          Comment


          • #6
            Originally posted by HLAgroupLK View Post
            I've also tried ExPasy tool and it generated an output on the screen but it's not clear to me how to download the result. Also, my final goal is to import the result into Culstal Omega to do the alignment so the format of the output has to be compatible. Should I stick with ExPasy?

            Thanks
            Option 1: Highlight and copy/paste the result data into a separate text file (I assume result is already in fasta format). Be sure to save the file in text format.

            Option 2: Choose "file" --> "Save Page as" from your browser window. Be sure to select format as "text file" for the file being saved.

            First option may be cleaner. You can then open the file in MEGA or upload to Clustal Omega.

            Comment

            Latest Articles

            Collapse

            • seqadmin
              Improved Targeted Sequencing: A Comprehensive Guide to Amplicon Sequencing
              by seqadmin



              Amplicon sequencing is a targeted approach that allows researchers to investigate specific regions of the genome. This technique is routinely used in applications such as variant identification, clinical research, and infectious disease surveillance. The amplicon sequencing process begins by designing primers that flank the regions of interest. The DNA sequences are then amplified through PCR (typically multiplex PCR) to produce amplicons complementary to the targets. RNA targets...
              03-21-2023, 01:49 PM
            • seqadmin
              Targeted Sequencing: Choosing Between Hybridization Capture and Amplicon Sequencing
              by seqadmin




              Targeted sequencing is an effective way to sequence and analyze specific genomic regions of interest. This method enables researchers to focus their efforts on their desired targets, as opposed to other methods like whole genome sequencing that involve the sequencing of total DNA. Utilizing targeted sequencing is an attractive option for many researchers because it is often faster, more cost-effective, and only generates applicable data. While there are many approaches...
              03-10-2023, 05:31 AM

            ad_right_rmr

            Collapse

            News

            Collapse

            Topics Statistics Last Post
            Started by seqadmin, Yesterday, 01:40 PM
            0 responses
            8 views
            0 likes
            Last Post seqadmin  
            Started by seqadmin, 03-29-2023, 11:44 AM
            0 responses
            12 views
            0 likes
            Last Post seqadmin  
            Started by seqadmin, 03-24-2023, 02:45 PM
            0 responses
            20 views
            0 likes
            Last Post seqadmin  
            Started by seqadmin, 03-22-2023, 12:26 PM
            0 responses
            28 views
            0 likes
            Last Post seqadmin  
            Working...
            X