Unconfigured Ad

Collapse
X
 
  • Filter
  • Time
  • Show
Clear All
new posts
  • rcorbett
    Member
    • Sep 2009
    • 29

    samtools flagstat mismatch in stats

    Hi all,
    I have a bam file created from illumina alignments with bwa. When I try to get the number of aligned reads 2 different ways I get 2 different numbers...

    When I run flagstat:
    samtools flagstat myBam.bam
    I get this many mapped reads
    41721001 mapped (93.82%)

    However when I try to get this on my own with this command:
    samtools view myBam.bam | awk -F '\t' '($3 != "*")' | wc -l
    I get this number
    42283661

    I'm sure I'm doing something wrong, but not sure what. Is it incorrect to think that all reads with a non '*' listed in the third column should be considered aligned?

    thanks for the help!
  • nilshomer
    Nils Homer
    • Nov 2008
    • 1283

    #2
    Originally posted by rcorbett View Post
    Hi all,
    I have a bam file created from illumina alignments with bwa. When I try to get the number of aligned reads 2 different ways I get 2 different numbers...

    When I run flagstat:
    samtools flagstat myBam.bam
    I get this many mapped reads
    41721001 mapped (93.82%)

    However when I try to get this on my own with this command:
    samtools view myBam.bam | awk -F '\t' '($3 != "*")' | wc -l
    I get this number
    42283661

    I'm sure I'm doing something wrong, but not sure what. Is it incorrect to think that all reads with a non '*' listed in the third column should be considered aligned?

    thanks for the help!
    You want:
    Code:
    samtools view -f 4 myBam.bam | wc -l
    See the FLAG field in the SAM spec.

    Comment

    • bioinfosm
      Senior Member
      • Jan 2008
      • 483

      #3
      When the flag field is '*', does it mean an unmapped read? I know that a flag 4 is definitely unmapped.
      --
      bioinfosm

      Comment

      • nilshomer
        Nils Homer
        • Nov 2008
        • 1283

        #4
        Originally posted by bioinfosm View Post
        When the flag field is '*', does it mean an unmapped read? I know that a flag 4 is definitely unmapped.
        The '*' in the flag field is invalid, and does not conform to the specification (see the "Note" in Section 2.2.1).

        Comment

        Latest Articles

        Collapse

        • SEQadmin2
          Nine Things a Sample Prep Scientist Thinks About Before Sequencing
          by SEQadmin2


          I’m not a sequencing expert. I’m a purification scientist who uses NGS to evaluate workflows my group develops. With this perspective, we think about the sample first and the NGS workflow second. The sequencer is an exceptionally honest reporter, but it can only report on what you give it, so whether you get clean, interpretable data from an NGS workflow is largely determined before you begin.

          Here are nine questions we think about, in roughly the order they matter, before...
          06-18-2026, 07:11 AM
        • SEQadmin2
          From Collection to Sequencing: Why Sample Preparation and Preservation Define Sequencing Data
          by SEQadmin2


          Data variability is still an issue in sequencing technologies despite the advances in reproducibility and accuracy of these platforms. But the problem does not originate in the sequencing itself, but in the previous steps, before the sample reaches the sequencer.


          The first step is collection, followed by preservation and sample preparation for analysis. Most scientists overlook those steps, but not being careful might just be skewing the experiment’s results.
          ...
          06-02-2026, 10:05 AM

        ad_right_rmr

        Collapse

        News

        Collapse

        Topics Statistics Last Post
        Started by SEQadmin2, 06-17-2026, 06:09 AM
        0 responses
        34 views
        0 reactions
        Last Post SEQadmin2  
        Started by SEQadmin2, 06-09-2026, 11:58 AM
        0 responses
        99 views
        0 reactions
        Last Post SEQadmin2  
        Started by SEQadmin2, 06-05-2026, 10:09 AM
        0 responses
        120 views
        0 reactions
        Last Post SEQadmin2  
        Started by SEQadmin2, 06-04-2026, 08:59 AM
        0 responses
        113 views
        0 reactions
        Last Post SEQadmin2  
        Working...