Is anyone aware of a conversion process to take gsMapper output (newbler) and turn it into the more standard SAM/BAM format?
Seqanswers Leaderboard Ad
Collapse
Announcement
Collapse
No announcement yet.
X
-
The development version of bank2contig can take the -S option to output an AMOS bank as SAM. To get from gsMapper to an AMOS bank you'll have to use the outputted .ace file that gsMapper gives you. I haven't gotten it to to work properly though as the resulting SAM file doesn't seem to contain the base qualities.
I'm seriously considering scripting it myself if Roche doesn't deliver soon
Comment
-
I've scripted a GS Mapper to SAM format converter in Ruby.
You can find it here:
At the moment it only handles single end reads.
The code might be a bit ugly as I used this as an educational project to get into Ruby. Instructions can be found in the readme.
Caveat: The sam file compiles to bam, but I haven't thoroughly checked it for errors, so there might still be some bugs. Please feel free to report any if you see them
Comment
-
Asifrim,
The bank2contig wtih the -S option doesn't give a SAM format. According to the documentation online, conversion to SAM should be using the -s option but as of version 2.0.8 that is not available. The -S option in version 2.0.8 give a Simple Layout Style.
I also tried your Ruby script GstoSam but it truncated prematurely.
MQ.
Comment
-
Hi MQ,
I seem to have mistyped it, it should have have been downcase: -s for SAM
(http://sourceforge.net/apps/mediawik...le=Bank2contig)
Did the ruby script return any kind of error? What version of Ruby are you using? Are you outputting the standard GS mapper output or the tabbed version of the output? If I remember correctly the script works on the default output type (which is the multiline output per read).
Comment
-
Originally posted by asifrim View PostI've scripted a GS Mapper to SAM format converter in Ruby.
You can find it here:
At the moment it only handles single end reads.
The code might be a bit ugly as I used this as an educational project to get into Ruby. Instructions can be found in the readme.
Caveat: The sam file compiles to bam, but I haven't thoroughly checked it for errors, so there might still be some bugs. Please feel free to report any if you see them
So I am using your script on gsMapper output. It seems to work to a first approximation. (Which is more than I can say for any of the other methods I have tried...) But it does not seem like I am quite there. Your script creates a .sam file -- but there is no header created. The sam specification claims http://samtools.sourceforge.net/SAM1.pdf
The SAM format consists of one header section and one alignment section.The whole header section can be absent, but keeping the header is recommended.
Code:samtools view -b Synechocystis-1.sam [bam_header_read] EOF marker is absent. [main_samview] fail to read the header.
Version: 0.1.7 (r510)
Is there a reason why for my purposes a header section is required? Perhaps because I was running gsMapper, rather than gsAssembler?
--
Phillip
PS Yes, I agree Roche programs should offer SAM/BAM output as well as .ace
Comment
Latest Articles
Collapse
-
by seqadmin
Non-coding RNAs (ncRNAs) do not code for proteins but play important roles in numerous cellular processes including gene silencing, developmental pathways, and more. There are numerous types including microRNA (miRNA), long ncRNA (lncRNA), circular RNA (circRNA), and more. In this article, we discuss innovative ncRNA research and explore recent technological advancements that improve the study of ncRNAs.
[Article Coming Soon!]...-
Channel: Articles
Today, 08:07 AM -
-
by seqadmin
Metagenomics has improved the way researchers study microorganisms across diverse environments. Historically, studying microorganisms relied on culturing them in the lab, a method that limits the investigation of many species since most are unculturable1. Metagenomics overcomes these issues by allowing the study of microorganisms regardless of their ability to be cultured or the environments they inhabit. Over time, the field has evolved, especially with the advent...-
Channel: Articles
09-23-2024, 06:35 AM -
-
by seqadmin
During the COVID-19 pandemic, scientists observed that while some individuals experienced severe illness when infected with SARS-CoV-2, others were barely affected. These disparities left researchers and clinicians wondering what causes the wide variations in response to viral infections and what role genetics plays.
Jean-Laurent Casanova, M.D., Ph.D., Professor at Rockefeller University, is a leading expert in this crossover between genetics and infectious...-
Channel: Articles
09-09-2024, 10:59 AM -
ad_right_rmr
Collapse
News
Collapse
Topics | Statistics | Last Post | ||
---|---|---|---|---|
Started by seqadmin, 10-02-2024, 04:51 AM
|
0 responses
13 views
0 likes
|
Last Post
by seqadmin
10-02-2024, 04:51 AM
|
||
Started by seqadmin, 10-01-2024, 07:10 AM
|
0 responses
23 views
0 likes
|
Last Post
by seqadmin
10-01-2024, 07:10 AM
|
||
Started by seqadmin, 09-30-2024, 08:33 AM
|
1 response
30 views
0 likes
|
Last Post
by EmiTom
Today, 06:46 AM
|
||
Started by seqadmin, 09-26-2024, 12:57 PM
|
0 responses
19 views
0 likes
|
Last Post
by seqadmin
09-26-2024, 12:57 PM
|
Comment