Seqanswers Leaderboard Ad

Collapse

Announcement

Collapse
No announcement yet.
X
 
  • Filter
  • Time
  • Show
Clear All
new posts

  • Bowtie 2 parameters

    Hi All,

    I am using bowtie 2 to align some illumina sample data on an influenza genome which has less than 14k nt. The samples have more than 10 millions of paired reads. Each reads having 100 bp.

    I initially launch the default bowtie 2 commands, without specifying supplement parameters. The percentage of paired reads which aligned correctly was between 35 and 50%. And the percentage of the overall alignment (including reads which align in a single way) was between 38 and 55 %.

    After some readings on the manual, I changed the parameters to the following command:

    bowtie2 -L 10 -N 1 -i S,1,0.20 --fr –x …….

    Which means that a seed length of 10, One mismatches is allowed in a seed, the seed interval is 3 (1+0.2*10).

    The alignment percentage increase between 40 to 75% for paired alignment and between 68 to 90% for overall alignment.

    The alignment's results are quite better, but the matter is that with a deep look onto the alignment. I noticed that there is some reads which aligned with more than 10 mismatches. We can even found some with 14, 18, 21 mismatches.

    That makes me doubt of my parameters and the quality of my alignment.

    I am a newer in the Bioinformatics and I would like to have, please, your point of vue on that issue.

    many thanks

  • #2
    Does the lower mapping rate make sense in light of the biology of the virus-- very rapid evolution, etc??

    Also, does whatever the research goal require that more than 38-55% of reads map. Nobody likes to throw away data, but in genomics/bioinformatics, one has to accept this to a degree..

    Lastly, have you tried looking at the reads that are not mapped? Are they lower quality reads? Have you tried to assemble the unmapped reads? Is contamination possible?

    Comment


    • #3
      I think it's normal to have some reads with more than 10 mismatch with your parameters the maximum of mismatch is about 30 mismatch (100/3)
      because you can have until 1 mimatch by seed of 10. You must try with N=0, the position of a read will be more accurate.
      VB

      Comment

      Latest Articles

      Collapse

      • seqadmin
        Understanding Genetic Influence on Infectious Disease
        by seqadmin




        During the COVID-19 pandemic, scientists observed that while some individuals experienced severe illness when infected with SARS-CoV-2, others were barely affected. These disparities left researchers and clinicians wondering what causes the wide variations in response to viral infections and what role genetics plays.

        Jean-Laurent Casanova, M.D., Ph.D., Professor at Rockefeller University, is a leading expert in this crossover between genetics and infectious...
        09-09-2024, 10:59 AM
      • seqadmin
        Addressing Off-Target Effects in CRISPR Technologies
        by seqadmin






        The first FDA-approved CRISPR-based therapy marked the transition of therapeutic gene editing from a dream to reality1. CRISPR technologies have streamlined gene editing, and CRISPR screens have become an important approach for identifying genes involved in disease processes2. This technique introduces targeted mutations across numerous genes, enabling large-scale identification of gene functions, interactions, and pathways3. Identifying the full range...
        08-27-2024, 04:44 AM

      ad_right_rmr

      Collapse

      News

      Collapse

      Topics Statistics Last Post
      Started by seqadmin, Today, 06:25 AM
      0 responses
      13 views
      0 likes
      Last Post seqadmin  
      Started by seqadmin, Yesterday, 01:02 PM
      0 responses
      12 views
      0 likes
      Last Post seqadmin  
      Started by seqadmin, 09-18-2024, 06:39 AM
      0 responses
      14 views
      0 likes
      Last Post seqadmin  
      Started by seqadmin, 09-11-2024, 02:44 PM
      0 responses
      14 views
      0 likes
      Last Post seqadmin  
      Working...
      X