Seqanswers Leaderboard Ad

Collapse

Announcement

Collapse
No announcement yet.
X
 
  • Filter
  • Time
  • Show
Clear All
new posts

  • How to Set Clipping Penalty in BWA ALN?

    Hi all!


    I'm using BWA aln (v 0.7.10) to map short reads (75bp - 100bp) to hg19 genome. Is it possible to adjust the penalty for soft-clipping? I want to set it lower than mismatch penalty in order to prevent mismatches at read ends and to favor soft-clipping.

    Alternatively, I could increase mismatch penalty, but what is the penalty for soft-clipping?


    Thanks for your help!
    Last edited by svos; 03-17-2015, 04:09 AM.

  • #2
    I'm not sure about bwa, but I do soft-clipping by trimming the ends to the point that the global score is maximized - therefore, the higher the mismatch penalty, the more soft-clipping will be done. Assuming that bwa operates in the same way, the amount of soft-clipping will be strictly dictated by the ratio of the match to mismatch (or gap, etc) scores. So to increase the amount of soft-clipping, just increase the mismatch penalty.

    Comment


    • #3
      At least for BWA mem algorithm this is not true. By default, it has a mismatch penalty of 4 and a clipping penalty of 5, so it rather introduces mismatches at the end of the reads instead of soft-slipping them.
      In BWA mem, these penalties can be set by the user, but no such parameter exists for BWA aln...

      Comment


      • #4
        In that case, why not use bwa-mem? It's supposed to be better for reads over 70bp, anyway. Do you find that aln gives better results?

        Comment


        • #5
          Yes, I see mapping artifacts that I can not explain when I use BWA mem! BWA mem is set out to be used with reads having a length >= 100bp, but I'd still rather use BWA aln, as I get more reliable results (with both, 75bp and 100bp reads).

          Comment

          Latest Articles

          Collapse

          • seqadmin
            Essential Discoveries and Tools in Epitranscriptomics
            by seqadmin




            The field of epigenetics has traditionally concentrated more on DNA and how changes like methylation and phosphorylation of histones impact gene expression and regulation. However, our increased understanding of RNA modifications and their importance in cellular processes has led to a rise in epitranscriptomics research. “Epitranscriptomics brings together the concepts of epigenetics and gene expression,” explained Adrien Leger, PhD, Principal Research Scientist...
            04-22-2024, 07:01 AM
          • seqadmin
            Current Approaches to Protein Sequencing
            by seqadmin


            Proteins are often described as the workhorses of the cell, and identifying their sequences is key to understanding their role in biological processes and disease. Currently, the most common technique used to determine protein sequences is mass spectrometry. While still a valuable tool, mass spectrometry faces several limitations and requires a highly experienced scientist familiar with the equipment to operate it. Additionally, other proteomic methods, like affinity assays, are constrained...
            04-04-2024, 04:25 PM

          ad_right_rmr

          Collapse

          News

          Collapse

          Topics Statistics Last Post
          Started by seqadmin, 04-25-2024, 11:49 AM
          0 responses
          20 views
          0 likes
          Last Post seqadmin  
          Started by seqadmin, 04-24-2024, 08:47 AM
          0 responses
          20 views
          0 likes
          Last Post seqadmin  
          Started by seqadmin, 04-11-2024, 12:08 PM
          0 responses
          62 views
          0 likes
          Last Post seqadmin  
          Started by seqadmin, 04-10-2024, 10:19 PM
          0 responses
          61 views
          0 likes
          Last Post seqadmin  
          Working...
          X