fasta group by sequence

You are currently viewing the SEQanswers forums as a guest, which limits your access. Click here to register now, and join the discussion

NicoBxl

not just another member

Join Date: Aug 2010

Posts: 264
- Share
- Tweet
#1

fasta group by sequence

06-21-2011, 11:43 PM

Hi,

How is the fastest way to parse a fasta file to produce an another fasta file where the same sequence are grouped and the number occurences are in the id (id_occurences).

ex:

Code:

>seq1 ATGCATGC >seq2 ATGCCCCC >seq3 ATGCATGC >seq4 ATGCGGGG >seq5 ATGCCCCC >seq6 ATGCATGC >seq7 ATGCAAAA >seq8 ATGCGGGG

Result :

Code:

>seq1_3 ATGCATGC >seq2_2 ATGCCCCC >seq3_2 ATGCGGGG >seq4_1 ATGCAAAA

Thanks in advance,

N.
Tags: None
NicoBxl

not just another member

Join Date: Aug 2010

Posts: 264
- Share
- Tweet
#2

06-21-2011, 11:50 PM

found in fastx
Comment

Previous template Next

Topics	Statistics	Last Post
Mechanical Forces in DNA Transcription Uncovered by Clemson Researchers by seqadmin Started by seqadmin, 10-02-2024, 04:51 AM	0 responses 13 views 0 likes	Last Post by seqadmin 10-02-2024, 04:51 AM
New Epigenetic Clock Links Cheek Cells to Mortality Risk by seqadmin Started by seqadmin, 10-01-2024, 07:10 AM	0 responses 21 views 0 likes	Last Post by seqadmin 10-01-2024, 07:10 AM
AI-Powered Blood Test Shows Promise for Early Ovarian Cancer Detection by seqadmin Started by seqadmin, 09-30-2024, 08:33 AM	0 responses 25 views 0 likes	Last Post by seqadmin 09-30-2024, 08:33 AM
Stem Cell Research Suggests Human Cells May Enter Developmental Pause by seqadmin Started by seqadmin, 09-26-2024, 12:57 PM	0 responses 18 views 0 likes	Last Post by seqadmin 09-26-2024, 12:57 PM

Working...

Seqanswers Leaderboard Ad