What'd be the best way to parse a CIGAR string fully according to the specification in C/C++? Would regular expression work?
Seqanswers Leaderboard Ad
Collapse
Announcement
Collapse
No announcement yet.
X
-
Originally posted by tedwong View PostWhat'd be the best way to parse a CIGAR string fully according to the specification in C/C++? Would regular expression work?
Here's an example in Java that can easily be translated to C++:
-
-
For reference (using the htslib library)
#include <htslib/sam.h>
auto f = sam_open(file.c_str(), "r");
auto h = sam_hdr_read(f);
auto t = bam_init1();
while (sam_read1(f, h, t) >= 0)
{
auto id = std::string(h->target_name[0]);
auto mapped = !(t->core.flag & BAM_FUNMAP);
const auto cigar = bam_get_cigar(t);
for (int k = 0; k < t->core.n_cigar; k++)
{
const int op = bam_cigar_op(cigar[k]);
const int ol = bam_cigar_oplen(cigar[k]);
if (op == BAM_CMATCH || op == BAM_CINS || op == BAM_CDEL)
{
// your code, you have the length in ol (eg: 101M -> ol == 101)
}
}
}
sam_close(f);
Comment
Latest Articles
Collapse
-
by seqadmin
Non-coding RNAs (ncRNAs) do not code for proteins but play important roles in numerous cellular processes including gene silencing, developmental pathways, and more. There are numerous types including microRNA (miRNA), long ncRNA (lncRNA), circular RNA (circRNA), and more. In this article, we discuss innovative ncRNA research and explore recent technological advancements that improve the study of ncRNAs.
Nobel Prize for MicroRNA Discovery
This week,...-
Channel: Articles
10-07-2024, 08:07 AM -
-
by seqadmin
Metagenomics has improved the way researchers study microorganisms across diverse environments. Historically, studying microorganisms relied on culturing them in the lab, a method that limits the investigation of many species since most are unculturable1. Metagenomics overcomes these issues by allowing the study of microorganisms regardless of their ability to be cultured or the environments they inhabit. Over time, the field has evolved, especially with the advent...-
Channel: Articles
09-23-2024, 06:35 AM -
ad_right_rmr
Collapse
News
Collapse
Topics | Statistics | Last Post | ||
---|---|---|---|---|
Started by seqadmin, Today, 06:55 AM
|
0 responses
8 views
0 likes
|
Last Post
by seqadmin
Today, 06:55 AM
|
||
Started by seqadmin, 10-02-2024, 04:51 AM
|
0 responses
105 views
0 likes
|
Last Post
by seqadmin
10-02-2024, 04:51 AM
|
||
Started by seqadmin, 10-01-2024, 07:10 AM
|
0 responses
113 views
0 likes
|
Last Post
by seqadmin
10-01-2024, 07:10 AM
|
||
Started by seqadmin, 09-30-2024, 08:33 AM
|
1 response
117 views
0 likes
|
Last Post
by EmiTom
10-07-2024, 06:46 AM
|
Comment