Header Leaderboard Ad

Collapse

Convert merged BAM back to per lane BAM or FASTQ file

Collapse

Announcement

Collapse
No announcement yet.
X
 
  • Filter
  • Time
  • Show
Clear All
new posts

  • Convert merged BAM back to per lane BAM or FASTQ file

    I have got some illumina data in a a merged BAM file for a particular sample and I would like to convert it back to per lane data files (BAM or fastq) so it would be compatible with the analysis pipeline. Any ideas on the best way to do this?

  • #2
    When I did this a few days ago (with only knowledge of command-line stuff and samtools), here's a rough guide to what I did:
    1. Extract header from BAM file
      Code:
      samtools view -H infile.bam > header.sam
    2. filter out lanes from BAM file, add header
      Code:
      (cat header.sam; samtools view infile.bam | grep '^DATA...._<lane>_.') > out_lane<lane>.sam
    3. convert SAM to BAM
      Code:
      samtools view -S out_lane<lane>.sam -b > out_lane<lane>.bam


    There's probably some better way, but it would have taken longer for me to find than for me to just do it and deal with the additional overhead of storing temporary SAM files per lane.

    Comment


    • #3
      If the lane info is stored in RG tags you can use this: http://genome.sph.umich.edu/wiki/SplitBam

      Comment


      • #4
        SplitBam looks ideal, but I can't find a download link for it. Any ideas?

        And thanks for the other suggestion too, that makes a lot of sense.

        --EDIT--
        Actually it looks like there are no @RG in the headers ... Oh Dear.
        Last edited by danielsbrewer; 09-29-2011, 01:03 AM.

        Comment


        • #5
          SplitBam download link?

          I am trying to do the exact same thing and I do have @RG tags, but I can not find a download link for SplitBam. Does anyone know how to download this program?

          Jason

          Comment


          • #6
            Originally posted by danielsbrewer View Post
            SplitBam looks ideal, but I can't find a download link for it. Any ideas?

            And thanks for the other suggestion too, that makes a lot of sense.

            --EDIT--
            Actually it looks like there are no @RG in the headers ... Oh Dear.
            Originally posted by jtladner View Post
            I am trying to do the exact same thing and I do have @RG tags, but I can not find a download link for SplitBam. Does anyone know how to download this program?

            Jason
            Go here and either follow the instructions for retrieving the current development release from the source repository or download the tarball for the latest stable realese (0.1.4). Note that either way you are downloading the entire libStatGen toolkit, which you must then compile. SplitBam is merely one of the programs in the toolkit.

            Comment


            • #7
              Originally posted by jtladner View Post
              I am trying to do the exact same thing and I do have @RG tags, but I can not find a download link for SplitBam. Does anyone know how to download this program?

              Jason


              splitBam is part of the bamUtil package, which i found it here https://github.com/Homebrew/homebrew-science and it was very easy to download it.

              Comment

              Latest Articles

              Collapse

              • seqadmin
                How RNA-Seq is Transforming Cancer Studies
                by seqadmin



                Cancer research has been transformed through numerous molecular techniques, with RNA sequencing (RNA-seq) playing a crucial role in understanding the complexity of the disease. Maša Ivin, Ph.D., Scientific Writer at Lexogen, and Yvonne Goepel Ph.D., Product Manager at Lexogen, remarked that “The high-throughput nature of RNA-seq allows for rapid profiling and deep exploration of the transcriptome.” They emphasized its indispensable role in cancer research, aiding in biomarker...
                09-07-2023, 11:15 PM
              • seqadmin
                Methods for Investigating the Transcriptome
                by seqadmin




                Ribonucleic acid (RNA) represents a range of diverse molecules that play a crucial role in many cellular processes. From serving as a protein template to regulating genes, the complex processes involving RNA make it a focal point of study for many scientists. This article will spotlight various methods scientists have developed to investigate different RNA subtypes and the broader transcriptome.

                Whole Transcriptome RNA-seq
                Whole transcriptome sequencing...
                08-31-2023, 11:07 AM

              ad_right_rmr

              Collapse

              News

              Collapse

              Topics Statistics Last Post
              Started by seqadmin, Today, 07:42 AM
              0 responses
              10 views
              0 likes
              Last Post seqadmin  
              Started by seqadmin, 09-22-2023, 09:05 AM
              0 responses
              23 views
              0 likes
              Last Post seqadmin  
              Started by seqadmin, 09-21-2023, 06:18 AM
              0 responses
              16 views
              0 likes
              Last Post seqadmin  
              Started by seqadmin, 09-20-2023, 09:17 AM
              0 responses
              16 views
              0 likes
              Last Post seqadmin  
              Working...
              X