Seqanswers Leaderboard Ad

Collapse

Announcement

Collapse
No announcement yet.
X
 
  • Filter
  • Time
  • Show
Clear All
new posts

  • Changing header of BAM

    Hello,

    I apologize if my question is redundant....

    I created bam files from fastq files by ELAND_standalone and tried to feed them to GATK. Then I got the following error message.
    ##### ERROR MESSAGE: SAM/BAM file reanalysis.bam is malformed: SAM file doesn't have any read groups defined in the header. The GATK no longer supports SAM files without read groups
    ##### ERROR -

    How can I change the header? Samtools?

    Thank you very much.

    Hiroki

  • #2
    you could easily change the header using samtools, but that won't solve the problem as the readgroup is necessary in each alignment section as well.
    You could have a look in the AddOrReplaceReadGroup Utility of Picard:


    I don't know if that changes the header as well but if not have a look here:

    Comment


    • #3
      Thanks ulz_peter!

      I was told to use Picard from another guy.
      Now I'm trying it.

      Thank you so much anyways!

      Comment


      • #4
        I've run into similar issues using GATK for bam files, and Picard. The option that always has helped me when I knew hte BAM file was "ok,"

        java [args to jvm] GATK.jar [GAKT opts] VALIDATION_STRINGENCY=SILENT

        Comment


        • #5
          Hi, I have the same problem as Hiroki.

          I made a sam file using command:
          Code:
          bwa samse ref.fa my.sai my.fastq > my.sam
          then, I have creaded .bam file:
          Code:
          samtools view -bS my.sam > my.bam
          next, I have sorted my.bam file:
          Code:
          samtools sort my.bam my_sorted
          Using GATK, I need to put a .bam file as an input.
          According to the instructions from http://www.broadinstitute.org/gsa/wi...s_for_the_GATK I have used Picard's ReordereSam, to reorder reads:
          Code:
          java -jar ReordereSam.jar I=/path/my_sorted.bam O=/path/my_reordered.bam R=/path/ref.fa
          Trying i.e.
          Code:
          java -jar GenomeAnalysisTK.jar -T DepthOfCoverage -R /paht/ref.fa -I /path/aln_reordered.bam
          an error is returned:
          HTML Code:
          ##### ERROR MESSAGE: SAM/BAM file /path/my_reordered.bam is malformed: SAM file doesn't have any read groups defined in the header.  The GATK no longer supports SAM files without read groups
          I have tried Picards AddOrReplaceReadGroups.jar and ReplaceSamHeader.jar but it didn't help me.

          Any suggestions how to bulid proper .bam file for GATK?
          thx

          Comment


          • #6
            hi the damian,
            when creating your sam file, you have to add read group header.
            this should work:

            bwa samse -r @RG\tID:IDa\tSM:SM\tPL:Illumina ref.fa my.sai my.fastq > my.sam
            bests

            colin

            Comment


            • #7
              Yes, I've already realised how to do this. Another option is to use Picard's AddOrReplaceReadGropus.jar.
              Code:
              java -jar AddOrReplaceReadGroups I=my.bam O=myGr.bam LB=whatever PL=illumina PU=whatever SM=whatever
              Of course prior using AddOrReplaceReadGroups your .sam file needs to pass Picard's validation (ValidateSamFile.jar)

              Comment

              Latest Articles

              Collapse

              • seqadmin
                Essential Discoveries and Tools in Epitranscriptomics
                by seqadmin


                The field of epigenetics has traditionally concentrated more on DNA and how changes like methylation and phosphorylation of histones impact gene expression and regulation. However, our increased understanding of RNA modifications and their importance in cellular processes has led to a rise in epitranscriptomics research. “Epitranscriptomics brings together the concepts of epigenetics and gene expression,” explained Adrien Leger, PhD, Principal Research Scientist on Modified Bases...
                Yesterday, 07:01 AM
              • seqadmin
                Current Approaches to Protein Sequencing
                by seqadmin


                Proteins are often described as the workhorses of the cell, and identifying their sequences is key to understanding their role in biological processes and disease. Currently, the most common technique used to determine protein sequences is mass spectrometry. While still a valuable tool, mass spectrometry faces several limitations and requires a highly experienced scientist familiar with the equipment to operate it. Additionally, other proteomic methods, like affinity assays, are constrained...
                04-04-2024, 04:25 PM

              ad_right_rmr

              Collapse

              News

              Collapse

              Topics Statistics Last Post
              Started by seqadmin, 04-11-2024, 12:08 PM
              0 responses
              37 views
              0 likes
              Last Post seqadmin  
              Started by seqadmin, 04-10-2024, 10:19 PM
              0 responses
              41 views
              0 likes
              Last Post seqadmin  
              Started by seqadmin, 04-10-2024, 09:21 AM
              0 responses
              35 views
              0 likes
              Last Post seqadmin  
              Started by seqadmin, 04-04-2024, 09:00 AM
              0 responses
              54 views
              0 likes
              Last Post seqadmin  
              Working...
              X