Header Leaderboard Ad

Collapse

gsMapper to SAM/BAM

Collapse

Announcement

Collapse
No announcement yet.
X
 
  • Filter
  • Time
  • Show
Clear All
new posts

  • gsMapper to SAM/BAM

    Is anyone aware of a conversion process to take gsMapper output (newbler) and turn it into the more standard SAM/BAM format?

  • #2
    You can try using the bank2contig tool of the CVS version of the AMOS package. You can use the toAmos tool to convert the ace to an Amos bank which is the input of the bank2contig tool.
    Last edited by asifrim; 02-11-2010, 04:36 AM.

    Comment


    • #3
      I am confused. bank2contig gives sam format?

      I am also interested in converting gsmapper output to sam format. any suggestions?
      --
      bioinfosm

      Comment


      • #4
        The development version of bank2contig can take the -S option to output an AMOS bank as SAM. To get from gsMapper to an AMOS bank you'll have to use the outputted .ace file that gsMapper gives you. I haven't gotten it to to work properly though as the resulting SAM file doesn't seem to contain the base qualities.

        I'm seriously considering scripting it myself if Roche doesn't deliver soon

        Comment


        • #5
          I've scripted a GS Mapper to SAM format converter in Ruby.
          You can find it here:

          http://homes.esat.kuleuven.be/~asifr...am/GstoSam.tgz

          At the moment it only handles single end reads.
          The code might be a bit ugly as I used this as an educational project to get into Ruby. Instructions can be found in the readme.

          Caveat: The sam file compiles to bam, but I haven't thoroughly checked it for errors, so there might still be some bugs. Please feel free to report any if you see them

          Comment


          • #6
            Asifrim,

            The bank2contig wtih the -S option doesn't give a SAM format. According to the documentation online, conversion to SAM should be using the -s option but as of version 2.0.8 that is not available. The -S option in version 2.0.8 give a Simple Layout Style.

            I also tried your Ruby script GstoSam but it truncated prematurely.

            MQ.

            Comment


            • #7
              Hi MQ,
              I seem to have mistyped it, it should have have been downcase: -s for SAM
              (http://sourceforge.net/apps/mediawik...le=Bank2contig)

              Did the ruby script return any kind of error? What version of Ruby are you using? Are you outputting the standard GS mapper output or the tabbed version of the output? If I remember correctly the script works on the default output type (which is the multiline output per read).

              Comment


              • #8
                Originally posted by asifrim View Post
                I've scripted a GS Mapper to SAM format converter in Ruby.
                You can find it here:

                http://homes.esat.kuleuven.be/~asifr...am/GstoSam.tgz

                At the moment it only handles single end reads.
                The code might be a bit ugly as I used this as an educational project to get into Ruby. Instructions can be found in the readme.

                Caveat: The sam file compiles to bam, but I haven't thoroughly checked it for errors, so there might still be some bugs. Please feel free to report any if you see them
                My application: I would like to view the output of a 454 fragment sequence run mapped (resequence) back to the reference genome of the organism sequenced using IGV.

                So I am using your script on gsMapper output. It seems to work to a first approximation. (Which is more than I can say for any of the other methods I have tried...) But it does not seem like I am quite there. Your script creates a .sam file -- but there is no header created. The sam specification claims http://samtools.sourceforge.net/SAM1.pdf

                The SAM format consists of one header section and one alignment section.The whole header section can be absent, but keeping the header is recommended.
                But attempts at converting this sam file to bam for sorting fail. For example:

                Code:
                 samtools view -b Synechocystis-1.sam
                [bam_header_read] EOF marker is absent.
                [main_samview] fail to read the header.
                I am running a fairly recent version of samtools:

                Version: 0.1.7 (r510)

                Is there a reason why for my purposes a header section is required? Perhaps because I was running gsMapper, rather than gsAssembler?

                --
                Phillip

                PS Yes, I agree Roche programs should offer SAM/BAM output as well as .ace

                Comment

                Latest Articles

                Collapse

                • seqadmin
                  A Brief Overview and Common Challenges in Single-cell Sequencing Analysis
                  by seqadmin


                  ​​​​​​The introduction of single-cell sequencing has advanced the ability to study cell-to-cell heterogeneity. Its use has improved our understanding of somatic mutations1, cell lineages2, cellular diversity and regulation3, and development in multicellular organisms4. Single-cell sequencing encompasses hundreds of techniques with different approaches to studying the genomes, transcriptomes, epigenomes, and other omics of individual cells. The analysis of single-cell sequencing data i...

                  01-24-2023, 01:19 PM
                • seqadmin
                  Introduction to Single-Cell Sequencing
                  by seqadmin
                  Single-cell sequencing is a technique used to investigate the genome, transcriptome, epigenome, and other omics of individual cells using high-throughput sequencing. This technology has provided many scientific breakthroughs and continues to be applied across many fields, including microbiology, oncology, immunology, neurobiology, precision medicine, and stem cell research.

                  The advancement of single-cell sequencing began in 2009 when Tang et al. investigated the single-cell transcriptomes
                  ...
                  01-09-2023, 03:10 PM

                ad_right_rmr

                Collapse
                Working...
                X