Seqanswers Leaderboard Ad

Collapse

Announcement

Collapse
No announcement yet.
X
 
  • Filter
  • Time
  • Show
Clear All
new posts

  • Alignment program that allows the greatest number of mismatches

    I am trying to align reads allowing for as many mismatches as possible. I was using Novocraft before and that program allows me to align up to about 9 mismatches. Is there a program out there that can allow more?

    thanks

  • #2
    Stampy?

    As far as I know Stampy is not restricted to a specific number of mismatches...

    http://www.well.ox.ac.uk/project-stampy

    Stampy has the following features:

    - Maps single, paired-end, mate pair Illumina reads to a reference
    - Fast: about 10 (with BWA) or 15 hours (without) per Gbase
    - Low memory footprint: 2.7 Gb shared memory for a 3Gbase genome
    - High sensitivity for indels and divergent reads, up to 10-15%
    - Low mapping bias for reads with SNPs or indels
    - Well calibrated mapping quality scores
    - Input: Fastq and Fasta; gzipped or plain; SAM and BAM
    - Output: SAM, Maq's map file
    - Optionally calculates per-base alignment posteriors
    - Optionally processes part of the input
    - Handles reads up to 4500 bases

    To calculate correct mapping qualities, Stampy needs to know the
    expected divergence from the reference. This is set with the
    --substitutionrate= option. The default is 0.001 substitutions per
    site.

    Increasing the read length, and using paired-end reads, helps mapping
    divergent reads. The following table gives an indication of the
    divergence at which a reasonable proportion of reads can be correctly
    mapped. These numbers were obtained by simulation, using the human
    genome as reference, and should be taken as an indication only; they
    are dependent on error rates, the repetitiveness of the genome, the
    insert size distribution, and local variations in divergence; in
    addition no indel mutations were included.

    36bp 36bp 72bp 72bp
    divergence | single paired single paired
    -------------------------------------------------------
    0% | 82% 95% 87% 96%
    3% | 73% 91% 80% 94%
    6% | 60% 83% 72% 92%
    9% | 41% 56% 56% 88%
    12% | 28% 51% 48% 80%

    Comment


    • #3
      BFAST does a better job than most aligners (Figure 3 of the paper shows comparative analysis), although I don't believe Stampy was included.

      Comment


      • #4
        How long are the reads you want to align?
        ecSeq Bioinformatics is Europe’s leading provider of hands-on bioinformatics workshops and professional data analysis in the field of Next-Generation Sequencing (NGS).

        Comment

        Latest Articles

        Collapse

        • seqadmin
          Recent Advances in Sequencing Analysis Tools
          by seqadmin


          The sequencing world is rapidly changing due to declining costs, enhanced accuracies, and the advent of newer, cutting-edge instruments. Equally important to these developments are improvements in sequencing analysis, a process that converts vast amounts of raw data into a comprehensible and meaningful form. This complex task requires expertise and the right analysis tools. In this article, we highlight the progress and innovation in sequencing analysis by reviewing several of the...
          Yesterday, 07:48 AM
        • seqadmin
          Essential Discoveries and Tools in Epitranscriptomics
          by seqadmin




          The field of epigenetics has traditionally concentrated more on DNA and how changes like methylation and phosphorylation of histones impact gene expression and regulation. However, our increased understanding of RNA modifications and their importance in cellular processes has led to a rise in epitranscriptomics research. “Epitranscriptomics brings together the concepts of epigenetics and gene expression,” explained Adrien Leger, PhD, Principal Research Scientist...
          04-22-2024, 07:01 AM

        ad_right_rmr

        Collapse

        News

        Collapse

        Topics Statistics Last Post
        Started by seqadmin, Yesterday, 07:17 AM
        0 responses
        11 views
        0 likes
        Last Post seqadmin  
        Started by seqadmin, 05-02-2024, 08:06 AM
        0 responses
        19 views
        0 likes
        Last Post seqadmin  
        Started by seqadmin, 04-30-2024, 12:17 PM
        0 responses
        20 views
        0 likes
        Last Post seqadmin  
        Started by seqadmin, 04-29-2024, 10:49 AM
        0 responses
        29 views
        0 likes
        Last Post seqadmin  
        Working...
        X