Seqanswers Leaderboard Ad

Collapse
X
 
  • Filter
  • Time
  • Show
Clear All
new posts
  • M.Askari
    Junior Member
    • Mar 2024
    • 1

    Why Is My Uniquely Mapped Read Count in Samtools Higher Than in Bowtie2?

    Hi everyone,

    I aligned my paired-end sequencing data using Bowtie2 and processed the resulting BAM files with Picard MarkDuplicates. However, I noticed a discrepancy between Bowtie2’s uniquely mapped read count and Samtools’ uniquely mapped reads after filtering. Bowtie2 alignment statistics (bowtie2.txt):


    8536975 reads; of these: 5180282 (60.68%) aligned concordantly 0 times (unmapped) 1868933 (21.89%) aligned concordantly exactly 1 time (uniquely mapped) 1487760 (17.43%) aligned concordantly >1 times (multi-mapped) Overall alignment rate: 39.32%
    • Uniquely mapped reads reported by Bowtie2: 1,868,933
    Samtools read count after removing duplicates (samtools view -c -q 30 CR05NFYA_S1_mdu.bam):

    2,877,386
    • Samtools reports ~1M more uniquely mapped reads than Bowtie2.
    Questions:
    1. Why does my final _mdu.bam file (uniquely mapped, deduplicated) contain ~1M more reads than Bowtie2’s unique count?
    2. Does Bowtie2 apply stricter filtering than samtools view -q 30 when classifying uniquely mapped reads?
    3. Could multi-mapped reads or soft-clipped alignments still be counted in _mdu.bam, even with MAPQ ≥ 30?
    4. How does paired-end read counting differ between Bowtie2 and Samtools? Does Bowtie2 exclude some properly paired reads?
    What I Have Tried:
    • Verified read counts before and after duplicate marking:bash
      CopyEdit
      samtools view -c -F 1024 CR05NFYA_S1_mdu.bam
    • Checked MAPQ distribution:bash
      CopyEdit
      samtools view CR05NFYA_S1_mdu.bam | awk '{print $5}' | sort -n | uniq -c
    • Excluded secondary alignments:bash
      CopyEdit
      samtools view -c -F 256 CR05NFYA_S1_mdu.bam

    I would appreciate any insights!

    Thanks in advance!
  • fchatonnet
    Member
    • Sep 2014
    • 23

    #2
    Dear M. Askari, bowtie2 indicates the number of pairs of reads which are correctly aligned exactly once, so the number of uniquely mapped reads after bowtie2 usage is 3,737,866 (1,868,933​ * 2).
    After removing the duplicates, samtools view -c counts the single reads in the bam file, not the pairs and you end up with 2,877,386 reads, meaning that duplicate filtering eliminated 860,480 duplicate reads.
    Cheers and good luck!

    Comment

    Latest Articles

    Collapse

    • seqadmin
      Pathogen Surveillance with Advanced Genomic Tools
      by seqadmin




      The COVID-19 pandemic highlighted the need for proactive pathogen surveillance systems. As ongoing threats like avian influenza and newly emerging infections continue to pose risks, researchers are working to improve how quickly and accurately pathogens can be identified and tracked. In a recent SEQanswers webinar, two experts discussed how next-generation sequencing (NGS) and machine learning are shaping efforts to monitor viral variation and trace the origins of infectious...
      Yesterday, 11:48 AM
    • seqadmin
      New Genomics Tools and Methods Shared at AGBT 2025
      by seqadmin


      This year’s Advances in Genome Biology and Technology (AGBT) General Meeting commemorated the 25th anniversary of the event at its original venue on Marco Island, Florida. While this year’s event didn’t include high-profile musical performances, the industry announcements and cutting-edge research still drew the attention of leading scientists.

      The Headliner
      The biggest announcement was Roche stepping back into the sequencing platform market. In the years since...
      03-03-2025, 01:39 PM
    • seqadmin
      Investigating the Gut Microbiome Through Diet and Spatial Biology
      by seqadmin




      The human gut contains trillions of microorganisms that impact digestion, immune functions, and overall health1. Despite major breakthroughs, we’re only beginning to understand the full extent of the microbiome’s influence on health and disease. Advances in next-generation sequencing and spatial biology have opened new windows into this complex environment, yet many questions remain. This article highlights two recent studies exploring how diet influences microbial...
      02-24-2025, 06:31 AM

    ad_right_rmr

    Collapse

    News

    Collapse

    Topics Statistics Last Post
    Started by seqadmin, 03-20-2025, 05:03 AM
    0 responses
    26 views
    0 reactions
    Last Post seqadmin  
    Started by seqadmin, 03-19-2025, 07:27 AM
    0 responses
    33 views
    0 reactions
    Last Post seqadmin  
    Started by seqadmin, 03-18-2025, 12:50 PM
    0 responses
    25 views
    0 reactions
    Last Post seqadmin  
    Started by seqadmin, 03-03-2025, 01:15 PM
    0 responses
    190 views
    0 reactions
    Last Post seqadmin  
    Working...