Header Leaderboard Ad

Collapse

help recovering data from bam w/ bad header

Collapse

Announcement

Collapse
No announcement yet.
X
 
  • Filter
  • Time
  • Show
Clear All
new posts

  • help recovering data from bam w/ bad header

    I have a bam file that I need to work with to reproduce an issue for some troubleshooting. This bam file was left by a student in a collaborator's lab and we pretty much only have this bam file (no upstream or downstream derivatives of the file). Samtools view gives this error message:

    [bam_header_read] invalid BAM binary header (this is not a BAM file).
    [main_samview] fail to read the header from "7_2.bam".

    Picard's ValidateSamFile produces only the message:

    ERROR: Read groups is empty
    SAMFormatException on record 01

    along with STDERR that looks like the java code is erroring out. Bamtools stats gives this error:

    bamtools stats ERROR: could not open input BAM file(s)... Aborting.


    This file is of a size that would be appropriate for the mapping done, and somehow the person who created it used it. It is a rather old copy that has now been passed around a bit but if it was truncated from a transfer I'd have expected different errors (I think...).

    Is there some way I can dump just the data from a bam file without dealing with the bad header? Or is there some way I can read in a generic 'dummy' header over the bad one to save the data?


    Thanks,
    John

  • #2
    How's your C programming? I suspect you're going to have to code something with htslib to figure out (A) where the problem is and (B) get around it as best as possible.

    Comment


    • #3
      I've no experience with C coding, I mainly stick to scripting languages (perl & python). But I'll take a look at the htslib methods and see if I can muddle through.

      Thanks,
      John

      Comment


      • #5
        I'll try the GATK ValidateSamFile and see what it says. GATK tools usually do give helpful error messages and I haven't tried that one yet.

        Thanks,
        John

        Comment

        Latest Articles

        Collapse

        • seqadmin
          How RNA-Seq is Transforming Cancer Studies
          by seqadmin



          Cancer research has been transformed through numerous molecular techniques, with RNA sequencing (RNA-seq) playing a crucial role in understanding the complexity of the disease. Maša Ivin, Ph.D., Scientific Writer at Lexogen, and Yvonne Goepel Ph.D., Product Manager at Lexogen, remarked that “The high-throughput nature of RNA-seq allows for rapid profiling and deep exploration of the transcriptome.” They emphasized its indispensable role in cancer research, aiding in biomarker...
          09-07-2023, 11:15 PM
        • seqadmin
          Methods for Investigating the Transcriptome
          by seqadmin




          Ribonucleic acid (RNA) represents a range of diverse molecules that play a crucial role in many cellular processes. From serving as a protein template to regulating genes, the complex processes involving RNA make it a focal point of study for many scientists. This article will spotlight various methods scientists have developed to investigate different RNA subtypes and the broader transcriptome.

          Whole Transcriptome RNA-seq
          Whole transcriptome sequencing...
          08-31-2023, 11:07 AM

        ad_right_rmr

        Collapse

        News

        Collapse

        Topics Statistics Last Post
        Started by seqadmin, 09-22-2023, 09:05 AM
        0 responses
        14 views
        0 likes
        Last Post seqadmin  
        Started by seqadmin, 09-21-2023, 06:18 AM
        0 responses
        12 views
        0 likes
        Last Post seqadmin  
        Started by seqadmin, 09-20-2023, 09:17 AM
        0 responses
        13 views
        0 likes
        Last Post seqadmin  
        Started by seqadmin, 09-19-2023, 09:23 AM
        0 responses
        28 views
        0 likes
        Last Post seqadmin  
        Working...
        X