reference sequence naming

smarkel

Junior Member

Join Date: Sep 2009

Posts: 7
- Share
- Tweet
#1

reference sequence naming

10-27-2009, 02:56 PM

We're currently grappling with the reference sequence name synonym issue. When we use mapping results or assembly results from different programs in conjunction with features from yet other sources, we find that we have to allow for a single reference sequence to have multiple names (IDs). Reference sequences may already exist (mapping) or may be generated for contigs (assembly). The result is that we have mapped reads with reference sequence IDs that we need to either convert to a standard set of IDs before adding the reads to BAM files or we need to use a synonym table and perform multiple BAM file queries according to the number of different names (IDs) a reference sequence might have. Using GFF files gives analogous problems.

How have others dealt with this? Should we just get used to the idea of always having perform a sed-like editing task in order to ensure a common naming convention for the reference sequences as referred to by mapped reads and features?
Tags: None

Previous template Next

Topics	Statistics	Last Post
A Close Examination at Probiotic-Related Bacteremia by seqadmin Started by seqadmin, Today, 08:06 AM	0 responses 11 views 0 likes	Last Post by seqadmin Today, 08:06 AM
Expanded Genetic Insights into Blood Pressure Regulation by seqadmin Started by seqadmin, 04-30-2024, 12:17 PM	0 responses 13 views 0 likes	Last Post by seqadmin 04-30-2024, 12:17 PM
The Role of Enhancers in Defining Cell Fate by seqadmin Started by seqadmin, 04-29-2024, 10:49 AM	0 responses 19 views 0 likes	Last Post by seqadmin 04-29-2024, 10:49 AM
Expanding the Horizons of Cellular Research with the Single Cell Atlas by seqadmin Started by seqadmin, 04-25-2024, 11:49 AM	0 responses 26 views 0 likes	Last Post by seqadmin 04-25-2024, 11:49 AM

Seqanswers Leaderboard Ad