Announcement

Collapse
No announcement yet.
X
 
  • Filter
  • Time
  • Show
Clear All
new posts

  • bwa sampe: proper pair but on different contigs!!??!!

    Dear all,


    Does anyone have an idea how the following is possible:
    I have reads mapped in a proper pair (as indicated by the sam-flag) but they map to different contigs!!!???

    HWUSI-EAS300R:7:1:15:1404#0 147 FW_DM_LINE_Jockey 128 29 74M FW3_DM_LINE_Jockey 3131 0 TGCAAGATCGCTTAAATACATAGTGAATTGTTATCTTAAATAATAAAACTATGAGTCAGAATGACACTCGCGCC Y^S[]^\[]a_XSZ[_]]_`_`]```_^a^`^`[aa__`]V]```aa\a_`]aaaaaaaa`Ta\a`aaaba`aa XT:A:U NM:i:0 SM:i:29 AM:i:29 X0:i:1 X1:i:0 XM:i:0 XO:i:0 XG:i:0 MD:Z:74
    HWUSI-EAS300R:7:1:15:1495#0 147 Gypsy4_LTR_LTR_Gypsy 112 60 74M Gypsy4_I_LTR_Gypsy 6216 0 CATTCCACTGCCCGGAGCGTGTGAAGCGCAATGTCAGCATTCTGCCGTGAGCGCTGCTTCAAAAGACGGGCTAC XUPM^NHLSMW\SWSPM\MW]PW\TZ\aPMP^MS^S]]Z^M_^X]^Z^]Z^]`a]^Z_\aaS]Z`Sa]a`_a\a XT:A:U NM:i:3 XN:i:1 SM:i:37 AM:i:37 X0:i:1 X1:i:0 XM:i:3 XO:i:0 XG:i:0 MD:Z:5T32C22G12
    HWUSI-EAS300R:7:1:22:1504#0 147 FW_DM_LINE_Jockey 85 29 74M FW3_DM_LINE_Jockey 3125 0 AACTAAATAAAAAATCTGAAAGCGAAAGAGACGCTCTATGCGATGCAAGATCGCTTAAATACATAGTGAATTGT ]N^I_^WG[[[_YNFQP[XGM\_^^S\a__^``_Y[a^\_a_```aaa`a]a`a````ba_baa`a_bbaabaa XT:A:U NM:i:0 SM:i:29 AM:i:29 X0:i:1 X1:i:0 XM:i:0 XO:i:0 XG:i:0 MD:Z:74
    HWUSI-EAS300R:7:1:25:1975#0 83 BLOOD_I_LTR_Gypsy 145 29 13M3D61M BLASTOPIA_LTR_LTR_Gypsy 271 0
    Hope anyone can help on this!!
    best ro

  • #2
    Could be a bug in the mapping tool used. What tool and what version was it?

    Comment


    • #3
      Mapper: bwa
      Version: 0.57
      command bwa aln -n 0.01 -o 2 -e 12 -d 12 -t 2 etc

      Comment


      • #4
        Is there any obvious link between the contigs, in particular are they subsequent entries in the FASTA reference file?

        Comment


        • #5
          I was under the impression that BWA concatenates all the references together and aligns reads against that long string. Might it have something to do with that?

          Comment


          • #6
            Yes they are subsequent entries in the fasta file! It is the insert of a LTR transposon followed by the LTR, i.e.: this sequences are frequently found in exactly this order in the different species.
            This could be an explanation for the problem than. If BWA is concatenating the sequences and measuring the distance between the mates, than it finds the difference is correct, while ignoring the fact that a contig boundary is crossed, and thus assigns the flag mapped in a proper pair.

            Comment


            • #7
              Originally posted by GoneSouth View Post
              Yes they are subsequent entries in the fasta file!
              Given Lee Sam's post you can probably see why I asked that

              i.e. This is probably a bug in BWA, wrongly marking the reads as "properly paired".

              Comment


              • #8
                Yes I do, many thanks for all your help!!
                Now that I know whats going on I can handle this in my sam parser.
                And maybee the people from Sanger will find some time to fix this in one of the next versions - I will send a bug report.
                thanks ro

                Comment

                Latest Articles

                Collapse

                • seqadmin
                  Advanced Methods for the Detection of Infectious Disease
                  by seqadmin




                  The recent pandemic caused worldwide health, economic, and social disruptions with its reverberations still felt today. A key takeaway from this event is the need for accurate and accessible tools for detecting and tracking infectious diseases. Timely identification is essential for early intervention, managing outbreaks, and preventing their spread. This article reviews several valuable tools employed in the detection and surveillance of infectious diseases.
                  ...
                  11-27-2023, 01:15 PM
                • seqadmin
                  Strategies for Investigating the Microbiome
                  by seqadmin




                  Microbiome research has led to the discovery of important connections to human and environmental health. Sequencing has become a core investigational tool in microbiome research, a subject that we covered during a recent webinar. Our expert speakers shared a number of advancements including improved experimental workflows, research involving transmission dynamics, and invaluable analysis resources. This article recaps their informative presentations, offering insights...
                  11-09-2023, 07:02 AM

                ad_right_rmr

                Collapse

                News

                Collapse

                Topics Statistics Last Post
                Started by seqadmin, 11-27-2023, 08:12 AM
                0 responses
                16 views
                0 likes
                Last Post seqadmin  
                Started by seqadmin, 11-22-2023, 09:29 AM
                1 response
                53 views
                0 likes
                Last Post VilliamPast  
                Started by seqadmin, 11-22-2023, 08:53 AM
                0 responses
                64 views
                0 likes
                Last Post seqadmin  
                Started by seqadmin, 11-21-2023, 08:24 AM
                0 responses
                32 views
                0 likes
                Last Post seqadmin  
                Working...
                X