Unconfigured Ad

Collapse
X
 
  • Filter
  • Time
  • Show
Clear All
new posts
  • jflowers
    Member
    • Oct 2011
    • 42

    SAM specification when 1 read from a pair is missing from the SAM

    Hello,

    After filtering a BAM with paired end reads, there are frequently orphan reads whose mate is no longer in the BAM file itself.

    I cannot find any means in which this is tracked in the record for the remaining orphaned read, which now has information about a read that no longer exists in the file.

    I tried running Picards FixMateInformation after removing a read from a pair and the record was unchanged. I thought the 0x1 flag might get switched to indicate it was a single end read, but more careful reading of the SAM specification indicates that 0x1 indicates that the read was "paired in sequencing", not paired in the alignment.

    Is this correct? My concern is that downstream applications that require PE reads (e.g., for structural variant discovery) might not handle BAMs with orphaned reads.

    Thanks.

    Jonathan
  • lh3
    Senior Member
    • Feb 2008
    • 686

    #2
    Don't drop reads from BAM. Turn them into unmapped records. Although the spec allows orphans in principle, you will have troubles elsewhere.

    Comment

    Latest Articles

    Collapse

    • SEQadmin2
      Nine Things a Sample Prep Scientist Thinks About Before Sequencing
      by SEQadmin2


      I’m not a sequencing expert. I’m a purification scientist who uses NGS to evaluate workflows my group develops. With this perspective, we think about the sample first and the NGS workflow second. The sequencer is an exceptionally honest reporter, but it can only report on what you give it, so whether you get clean, interpretable data from an NGS workflow is largely determined before you begin.

      Here are nine questions we think about, in roughly the order they matter, before...
      06-18-2026, 07:11 AM
    • SEQadmin2
      From Collection to Sequencing: Why Sample Preparation and Preservation Define Sequencing Data
      by SEQadmin2


      Data variability is still an issue in sequencing technologies despite the advances in reproducibility and accuracy of these platforms. But the problem does not originate in the sequencing itself, but in the previous steps, before the sample reaches the sequencer.


      The first step is collection, followed by preservation and sample preparation for analysis. Most scientists overlook those steps, but not being careful might just be skewing the experiment’s results.
      ...
      06-02-2026, 10:05 AM

    ad_right_rmr

    Collapse

    News

    Collapse

    Topics Statistics Last Post
    Started by SEQadmin2, 06-17-2026, 06:09 AM
    0 responses
    34 views
    0 reactions
    Last Post SEQadmin2  
    Started by SEQadmin2, 06-09-2026, 11:58 AM
    0 responses
    99 views
    0 reactions
    Last Post SEQadmin2  
    Started by SEQadmin2, 06-05-2026, 10:09 AM
    0 responses
    119 views
    0 reactions
    Last Post SEQadmin2  
    Started by SEQadmin2, 06-04-2026, 08:59 AM
    0 responses
    112 views
    0 reactions
    Last Post SEQadmin2  
    Working...