Seqanswers Leaderboard Ad

Collapse

Announcement

Collapse
No announcement yet.
X
 
  • Filter
  • Time
  • Show
Clear All
new posts

  • help recovering data from bam w/ bad header

    I have a bam file that I need to work with to reproduce an issue for some troubleshooting. This bam file was left by a student in a collaborator's lab and we pretty much only have this bam file (no upstream or downstream derivatives of the file). Samtools view gives this error message:

    [bam_header_read] invalid BAM binary header (this is not a BAM file).
    [main_samview] fail to read the header from "7_2.bam".

    Picard's ValidateSamFile produces only the message:

    ERROR: Read groups is empty
    SAMFormatException on record 01

    along with STDERR that looks like the java code is erroring out. Bamtools stats gives this error:

    bamtools stats ERROR: could not open input BAM file(s)... Aborting.


    This file is of a size that would be appropriate for the mapping done, and somehow the person who created it used it. It is a rather old copy that has now been passed around a bit but if it was truncated from a transfer I'd have expected different errors (I think...).

    Is there some way I can dump just the data from a bam file without dealing with the bad header? Or is there some way I can read in a generic 'dummy' header over the bad one to save the data?


    Thanks,
    John

  • #2
    How's your C programming? I suspect you're going to have to code something with htslib to figure out (A) where the problem is and (B) get around it as best as possible.

    Comment


    • #3
      I've no experience with C coding, I mainly stick to scripting languages (perl & python). But I'll take a look at the htslib methods and see if I can muddle through.

      Thanks,
      John

      Comment


      • #5
        I'll try the GATK ValidateSamFile and see what it says. GATK tools usually do give helpful error messages and I haven't tried that one yet.

        Thanks,
        John

        Comment

        Latest Articles

        Collapse

        • seqadmin
          Recent Advances in Sequencing Analysis Tools
          by seqadmin


          The sequencing world is rapidly changing due to declining costs, enhanced accuracies, and the advent of newer, cutting-edge instruments. Equally important to these developments are improvements in sequencing analysis, a process that converts vast amounts of raw data into a comprehensible and meaningful form. This complex task requires expertise and the right analysis tools. In this article, we highlight the progress and innovation in sequencing analysis by reviewing several of the...
          Yesterday, 07:48 AM
        • seqadmin
          Essential Discoveries and Tools in Epitranscriptomics
          by seqadmin




          The field of epigenetics has traditionally concentrated more on DNA and how changes like methylation and phosphorylation of histones impact gene expression and regulation. However, our increased understanding of RNA modifications and their importance in cellular processes has led to a rise in epitranscriptomics research. “Epitranscriptomics brings together the concepts of epigenetics and gene expression,” explained Adrien Leger, PhD, Principal Research Scientist...
          04-22-2024, 07:01 AM

        ad_right_rmr

        Collapse

        News

        Collapse

        Topics Statistics Last Post
        Started by seqadmin, Today, 06:57 AM
        0 responses
        9 views
        0 likes
        Last Post seqadmin  
        Started by seqadmin, Yesterday, 07:17 AM
        0 responses
        14 views
        0 likes
        Last Post seqadmin  
        Started by seqadmin, 05-02-2024, 08:06 AM
        0 responses
        19 views
        0 likes
        Last Post seqadmin  
        Started by seqadmin, 04-30-2024, 12:17 PM
        0 responses
        24 views
        0 likes
        Last Post seqadmin  
        Working...
        X