Seqanswers Leaderboard Ad

Collapse

Announcement

Collapse
No announcement yet.
X
 
  • Filter
  • Time
  • Show
Clear All
new posts

  • BWA - Why giving only one match per read ?

    I am doing experiments with BWA and BowTie. Now, I am finding alignments using command:


    ./bwa aln -n 0 -k 41 database.fa SRR4493095_1.fastq > aln_sa.sai

    and

    ./bwa samse database .fa aln_sa.sai SRR4493095_1.fastq > out_sa.sam

    However, while BowTie gives 123 number of matches for 100 reads, BWA gives just 100 number of matches (one for each, does not reporting if match in another position found). Can anybody help me why is this happening and how to solve this ? I give -n 0 option because, I want to find matching allowing no mismatch.

    Thanks in advance.

  • #2
    Originally posted by Arupsss View Post
    I am doing experiments with BWA and BowTie.
    Hi, could you please post your bowtie command, as well, for completeness?
    Why did you set -k to 41?
    How long are your reads?

    Comment


    • #3
      Originally posted by sdvie View Post
      Hi, could you please post your bowtie command, as well, for completeness?
      Here it is:

      ./bowtie -a -v 0 database SRR4493095_1.fastq out.txt

      Comment


      • #4
        Are you well aware of the options you are using in bwa?
        I am highlighting the ones that you are using or that you might consider using (from bwa 0.6.1):

        Code:
        bwa aln [options] <prefix> <in.fq>
        Options: [B]-n NUM    max #diff (int) or missing prob under 0.02 err rate (float) [0.04][/B]
                 -o INT    maximum number or fraction of gap opens [1]
                 -e INT    maximum number of gap extensions, -1 for disabling long gaps [-1]
                 -i INT    do not put an indel within INT bp towards the ends [5]
                 -d INT    maximum occurrences for extending a long deletion [10]
                 [B]-l INT    seed length [32][/B]
                 [B]-k INT    maximum differences in the seed [2][/B]
                 -m INT    maximum entries in the queue [2000000]
                 -t INT    number of threads [1]
                 -M INT    mismatch penalty [3]
                 -O INT    gap open penalty [11]
                 -E INT    gap extension penalty [4]
                 -R INT    stop searching when there are >INT equally best hits [30]
                 -q INT    quality threshold for read trimming down to 35bp [0]
                 -f FILE   file to write output to instead of stdout
                 -B INT    length of barcode
                 -c        input sequences are in the color space
                 -L        log-scaled gap penalty for long deletions
                 [B]-N        non-iterative mode: search for all n-difference hits (slooow)[/B]
                 -I        the input is in the Illumina 1.3+ FASTQ-like format
                 -b        the input read file is in the BAM format
                 -0        use single-end reads only (effective with -b)
                 -1        use the 1st read in a pair (effective with -b)
                 -2        use the 2nd read in a pair (effective with -b)
        hope that helps,
        cheers,
        Sophia

        Comment


        • #5
          Thanks. But, from the above,you mean to use N option ? Because, I use only n and k options, not others.

          Comment


          • #6
            Originally posted by Arupsss View Post
            Thanks. But, from the above,you mean to use N option ? Because, I use only n and k options, not others.
            yes.
            And -k indicates the number of mismatches in the seed, and therefore, 41 is an unusual value for that. Why did you set it to 41?

            see here:
            BWA manual

            cheers

            Comment


            • #7
              Originally posted by Arupsss View Post
              I am doing experiments with BWA and BowTie. Now, I am finding alignments using command:


              ./bwa aln -n 0 -k 41 database.fa SRR4493095_1.fastq > aln_sa.sai

              and

              ./bwa samse database .fa aln_sa.sai SRR4493095_1.fastq > out_sa.sam

              However, while BowTie gives 123 number of matches for 100 reads, BWA gives just 100 number of matches (one for each, does not reporting if match in another position found). Can anybody help me why is this happening and how to solve this ? I give -n 0 option because, I want to find matching allowing no mismatch.

              Thanks in advance.
              That's how bwa works. It only returns on position when a read maps multiple times, it just picks one randomly. However, one of the tags in the .sam entry will contain the other positions where the read mapped equally well.

              Comment

              Latest Articles

              Collapse

              • seqadmin
                Current Approaches to Protein Sequencing
                by seqadmin


                Proteins are often described as the workhorses of the cell, and identifying their sequences is key to understanding their role in biological processes and disease. Currently, the most common technique used to determine protein sequences is mass spectrometry. While still a valuable tool, mass spectrometry faces several limitations and requires a highly experienced scientist familiar with the equipment to operate it. Additionally, other proteomic methods, like affinity assays, are constrained...
                04-04-2024, 04:25 PM
              • seqadmin
                Strategies for Sequencing Challenging Samples
                by seqadmin


                Despite advancements in sequencing platforms and related sample preparation technologies, certain sample types continue to present significant challenges that can compromise sequencing results. Pedro Echave, Senior Manager of the Global Business Segment at Revvity, explained that the success of a sequencing experiment ultimately depends on the amount and integrity of the nucleic acid template (RNA or DNA) obtained from a sample. “The better the quality of the nucleic acid isolated...
                03-22-2024, 06:39 AM

              ad_right_rmr

              Collapse

              News

              Collapse

              Topics Statistics Last Post
              Started by seqadmin, 04-11-2024, 12:08 PM
              0 responses
              30 views
              0 likes
              Last Post seqadmin  
              Started by seqadmin, 04-10-2024, 10:19 PM
              0 responses
              32 views
              0 likes
              Last Post seqadmin  
              Started by seqadmin, 04-10-2024, 09:21 AM
              0 responses
              28 views
              0 likes
              Last Post seqadmin  
              Started by seqadmin, 04-04-2024, 09:00 AM
              0 responses
              52 views
              0 likes
              Last Post seqadmin  
              Working...
              X