Seqanswers Leaderboard Ad

Collapse

Announcement

Collapse
No announcement yet.
X
 
  • Filter
  • Time
  • Show
Clear All
new posts

  • Problem with BWA mapping of Illumina PE short insert size fragments (FFPE material)

    Hi,
    I am having difficulties mapping paired end Illumina reads on FFPE material (i.e. short insert size, approximately 80-150 bp), with BWA (both version 0.5.9 and the newest version 0.6.1).
    To enforce the short insert size the following flags were used:
    "$BWA sampe -a 500 -A -o 10000 -r …"

    Mapping is done without errors however when viewing the sorted.bam files, the paired reads are more often than not, mapped to different chromosomes (as exemplified by the picture), despite the fact that the reads overlap with each other, due to the short insert size!
    The majority of the reads contained adapter sequence which was removed by cut adapt, which suggests that the insert size was smaller than 100 bp. Is this what is causing problems when mapping and how would I go about getting around this?

    Your reply would be much appreciated!
    Attached Files

  • #2
    That doesn't seem too likely to me. I've got some data recently where the insert sizes are a little smaller and more variable than I'd like, and bwa has no problem assigning them to the right coordinates, with lots of overlap where necesary. But than again, my samples are bacteria, and there's a lot less genome for the aligner to play with.

    3 Mb, isn't that pretty close to the telomere? Do you see the same problem in the middle of the chromosome?

    First thing I'd do is to spot check both sequences for those cross chromosome clusters. If you manually blast those sequences, can you get alignment positions that make more sense than what bwa assigned? The mapping quality seems to be high for those, so that's not the problem.

    Comment


    • #3
      Hello,

      I thought Id give a late response with an update on what I did.

      Since BWA PE mapping was not successful, (i.e. paired reads mapped to different chromosomes in 80% of all cases. And the mapping quality of these was still high! This is probably due to the extremely short insert size in the FFPE material, that messes with the mapping algorithm.)

      I tried BWA0.6.1 – same result.

      Adapter-trimmed sequences are to short to contain more information if mapped in pairs, whereas sequences longer than 100 bp contain additional information when mapped in pairs -> So I wrote a perl script that separates trimmed reads into one file (for SE mapping) and untrimmed read pairs into two files (for PE mapping). It worked fine.

      Comment

      Latest Articles

      Collapse

      • seqadmin
        Non-Coding RNA Research and Technologies
        by seqadmin




        Non-coding RNAs (ncRNAs) do not code for proteins but play important roles in numerous cellular processes including gene silencing, developmental pathways, and more. There are numerous types including microRNA (miRNA), long ncRNA (lncRNA), circular RNA (circRNA), and more. In this article, we discuss innovative ncRNA research and explore recent technological advancements that improve the study of ncRNAs.

        Nobel Prize for MicroRNA Discovery
        This week,...
        10-07-2024, 08:07 AM
      • seqadmin
        Recent Developments in Metagenomics
        by seqadmin





        Metagenomics has improved the way researchers study microorganisms across diverse environments. Historically, studying microorganisms relied on culturing them in the lab, a method that limits the investigation of many species since most are unculturable1. Metagenomics overcomes these issues by allowing the study of microorganisms regardless of their ability to be cultured or the environments they inhabit. Over time, the field has evolved, especially with the advent...
        09-23-2024, 06:35 AM

      ad_right_rmr

      Collapse

      News

      Collapse

      Topics Statistics Last Post
      Started by seqadmin, 10-02-2024, 04:51 AM
      0 responses
      101 views
      0 likes
      Last Post seqadmin  
      Started by seqadmin, 10-01-2024, 07:10 AM
      0 responses
      110 views
      0 likes
      Last Post seqadmin  
      Started by seqadmin, 09-30-2024, 08:33 AM
      1 response
      114 views
      0 likes
      Last Post EmiTom
      by EmiTom
       
      Started by seqadmin, 09-26-2024, 12:57 PM
      0 responses
      20 views
      0 likes
      Last Post seqadmin  
      Working...
      X