Seqanswers Leaderboard Ad

Collapse

Announcement

Collapse
No announcement yet.
X
 
  • Filter
  • Time
  • Show
Clear All
new posts

  • bowtie2 fragment length

    Dear Bowtie2 user,

    I have a question regarding the fragment length issue.

    The reads that mapped to the different chromosomes sometimes have a fragment length of 0 but sometimes non-0. From the bowtie2 manuel
    I understand that the fragment length has to be non-0 if the mates mapped to the same chromosome.

    "Inferred fragment length. Size is negative if the mate's alignment occurs upstream of this alignment. Size is 0 if the mates did not align concordantly. However, size is non-0 if the mates aligned discordantly to the same chromosome."

    Here is an example from my bowtie2 output:
    Example for 0 fragment length

    MISEQ:52:000000000-A4DCH:1:1101:16444:3647 65 chr9 71742006 24 22M = 3007124 0 TTAATGTCTGTCTCCCTTACTG FFBFFFFGGGGFGGGGGHHHHG AS:i:-5 XN:i:0 XM:i:1 XO:i:0 XG:i:0 NM:i:1 MD:Z:21A0 YT:Z:UP
    MISEQ:52:000000000-A4DCH:1:1101:16444:3647 129 chr9 3007124 0 29M = 71742006 0 TCGTCATTTTTCAAGTCGTCAAGGGGATG ABBBBBFFFFFFGGGGGGGGGGCHFEEGG AS:i:-5 XN:i:0 XM:i:1 XO:i:0 XG:i:0 NM:i:1 MD:Z:23T5 YT:Z:UP

    Example for non-0 fragment length

    MISEQ:52:000000000-A4DCH:1:1101:17281:1944 65 chr14 54510108 42 23M = 52820128 -1690003 TTAAAGGTAAAAACCACCAACAT BFAFFFF1BGGGGFEF1AGFEGH AS:i:0 XN:i:0 XM:i:0 XO:i:0 XG:i:0 NM:i:0 MD:Z:23 YS:i:0 YT:Z: DP
    MISEQ:52:000000000-A4DCH:1:1101:17281:1944 129 chr14 52820128 42 31M = 54510108 1690003 CCTTGGAAGCCCACTGGAAAAAATGACAGAT AAABAFFFFCFBGGGGGGGGGGGHHHHGHFG AS:i:0 XN:i:0 XM:i:0 XO:i:0 XG:i:0 NM:i:0 MD:Z:31 YS:i:0 YT:Z: DP

    Do I miss something or is it a bowtie2 bug?

    Thanks in advance,

    Mary

  • #2
    Well both examples are discordant within the same chromosome (chr9 for the first example, chr14 for the second). However in your first example the MAPQ of the second read is 0. This may cause bowtie2 to not want to infer a TLEN (fragment size) thus setting it to 0.

    Comment

    Latest Articles

    Collapse

    • seqadmin
      Current Approaches to Protein Sequencing
      by seqadmin


      Proteins are often described as the workhorses of the cell, and identifying their sequences is key to understanding their role in biological processes and disease. Currently, the most common technique used to determine protein sequences is mass spectrometry. While still a valuable tool, mass spectrometry faces several limitations and requires a highly experienced scientist familiar with the equipment to operate it. Additionally, other proteomic methods, like affinity assays, are constrained...
      04-04-2024, 04:25 PM
    • seqadmin
      Strategies for Sequencing Challenging Samples
      by seqadmin


      Despite advancements in sequencing platforms and related sample preparation technologies, certain sample types continue to present significant challenges that can compromise sequencing results. Pedro Echave, Senior Manager of the Global Business Segment at Revvity, explained that the success of a sequencing experiment ultimately depends on the amount and integrity of the nucleic acid template (RNA or DNA) obtained from a sample. “The better the quality of the nucleic acid isolated...
      03-22-2024, 06:39 AM

    ad_right_rmr

    Collapse

    News

    Collapse

    Topics Statistics Last Post
    Started by seqadmin, 04-11-2024, 12:08 PM
    0 responses
    30 views
    0 likes
    Last Post seqadmin  
    Started by seqadmin, 04-10-2024, 10:19 PM
    0 responses
    32 views
    0 likes
    Last Post seqadmin  
    Started by seqadmin, 04-10-2024, 09:21 AM
    0 responses
    28 views
    0 likes
    Last Post seqadmin  
    Started by seqadmin, 04-04-2024, 09:00 AM
    0 responses
    52 views
    0 likes
    Last Post seqadmin  
    Working...
    X