Seqanswers Leaderboard Ad

Collapse

Announcement

Collapse
No announcement yet.
X
 
  • Filter
  • Time
  • Show
Clear All
new posts

  • Reads not mapped in proper pair: Bowtie output

    Hello,

    when I align paired end fastq reads mapping to two different chromosomes using BWA, I typically get this result:

    ILLUMINA-8B7A89:1:33:5159:13399#TAGCTTA#0 65 chr22 21854387 37 30M chr9 132720172 0 TGGCCCAACGATGGCGAGGGCGCCTTCCAT
    GEFGGGEBEEEEEEEEEAEECDC==BCCBA

    ILLUMINA-8B7A89:1:33:5159:13399#TAGCTTA#0 129 chr9 132720172 37 30M chr22 21854387 0 TCTGCTGAGCAGCGGGATCAATGGCAGCTT
    FBFDEAEGEGGGEGGGDF5GGGGGGGGBGD


    where BWA correctly reports the alignment and labels the read as 'not mapped in proper pair' (flag 2 off).

    When I try the same using Bowtie I get a completely different result, with the paired read flagged as unmapped:

    ILLUMINA-8B7A89:1:33:5159:13399#TAGCTTA#0 77 * 0 0 * * 0 0 TGGCCCAACGATGGCGAGGGCGCCTTCCAT GEFGGGEBEEEEEEEEEAEECDC==BCCBA

    ILLUMINA-8B7A89:1:33:5159:13399#TAGCTTA#0 141 * 0 0 * * 0 0 TCTGCTGAGCAGCGGGATCAATGGCAGCTT
    FBFDEAEGEGGGEGGGDF5GGGGGGGGBGD


    Notably, when I map the two fastq as single reads, Bowtie reports the correct mapping to Chr9 and 22. I tried to play with Bowtie parameters with no luck. Of course I could try to output the 'unmapped' reads and remap as single reads but this is more complex and highly inefficient. Is there any Bowtie setting that allows me to correctly map 'flag-2-off' reads?


    Thank you in advance,
    Rocco

  • #2
    Notably, when I map the two fastq as single reads, Bowtie reports the correct mapping to Chr9 and 22
    This doesn't sound at all like a correct mapping, unless you have a chromosome-level genome modification. It doesn't surprise me that bowtie flags these as unmapped, because it has a limit on how far apart it will search for the matching pair.

    Comment


    • #3
      Hello Gringer,

      thanks for your reply. What I'd like to highlight here is the difference between 'Unmapped' (i.e. SAM flag 0x0004 and 0x0008) and 'Not mapped in proper pair' (0x0002). The two sequences are (or should be) mapped, thus the unmapped flags should be set off and the corresponding Chrs and position reported in the SAM output.
      As you correctly stated, the 'distance' between the two reads is definitely abnormal, so I would expect a 0x0002 flag off.
      If you look at the BWA output, it reports the paired read with a 65/129 flag, where:

      65:
      Read paired
      Read not mapped in proper pair ***
      Read mapped ***
      Mate mapped ***
      1st in pair

      129
      Read paired
      Read not mapped in proper pair ***
      Read mapped ***
      Mate mapped ***
      2nd in pair

      Please, note that the two sequences ARE mapped although they are not in proper pair.
      This sounds more logical to me. If you know a way to force Bowtie to report like this, please let me know!


      Regards,
      Rocco

      Comment


      • #4
        well, I'm not sure of any workarounds in Bowtie, but I agree that this is an incorrect output and I've submitted a bug report:

        Comment


        • #5
          I've just had a response for the bug report:
          Bowtie 1 is not smart enough to notice or report when mates from a
          paired-end read align non-concordantly. In Bowtie 1, either a pair aligns
          concordantly or it fails to align. Thus, I don't think the issue you raise
          comes up. Bowtie 2, however, is smart enough to try to align the mates
          separately, and it reports the correct SAM flags in that situation.

          Comment

          Latest Articles

          Collapse

          • seqadmin
            Recent Advances in Sequencing Analysis Tools
            by seqadmin


            The sequencing world is rapidly changing due to declining costs, enhanced accuracies, and the advent of newer, cutting-edge instruments. Equally important to these developments are improvements in sequencing analysis, a process that converts vast amounts of raw data into a comprehensible and meaningful form. This complex task requires expertise and the right analysis tools. In this article, we highlight the progress and innovation in sequencing analysis by reviewing several of the...
            Yesterday, 07:48 AM
          • seqadmin
            Essential Discoveries and Tools in Epitranscriptomics
            by seqadmin




            The field of epigenetics has traditionally concentrated more on DNA and how changes like methylation and phosphorylation of histones impact gene expression and regulation. However, our increased understanding of RNA modifications and their importance in cellular processes has led to a rise in epitranscriptomics research. “Epitranscriptomics brings together the concepts of epigenetics and gene expression,” explained Adrien Leger, PhD, Principal Research Scientist...
            04-22-2024, 07:01 AM

          ad_right_rmr

          Collapse

          News

          Collapse

          Topics Statistics Last Post
          Started by seqadmin, Today, 06:57 AM
          0 responses
          11 views
          0 likes
          Last Post seqadmin  
          Started by seqadmin, Yesterday, 07:17 AM
          0 responses
          14 views
          0 likes
          Last Post seqadmin  
          Started by seqadmin, 05-02-2024, 08:06 AM
          0 responses
          19 views
          0 likes
          Last Post seqadmin  
          Started by seqadmin, 04-30-2024, 12:17 PM
          0 responses
          24 views
          0 likes
          Last Post seqadmin  
          Working...
          X