Seqanswers Leaderboard Ad

Collapse

Announcement

Collapse
No announcement yet.
X
 
  • Filter
  • Time
  • Show
Clear All
new posts

  • BWA - Why giving only one match per read ?

    I am doing experiments with BWA and BowTie. Now, I am finding alignments using command:


    ./bwa aln -n 0 -k 41 database.fa SRR4493095_1.fastq > aln_sa.sai

    and

    ./bwa samse database .fa aln_sa.sai SRR4493095_1.fastq > out_sa.sam

    However, while BowTie gives 123 number of matches for 100 reads, BWA gives just 100 number of matches (one for each, does not reporting if match in another position found). Can anybody help me why is this happening and how to solve this ? I give -n 0 option because, I want to find matching allowing no mismatch.

    Thanks in advance.

  • #2
    Originally posted by Arupsss View Post
    I am doing experiments with BWA and BowTie.
    Hi, could you please post your bowtie command, as well, for completeness?
    Why did you set -k to 41?
    How long are your reads?

    Comment


    • #3
      Originally posted by sdvie View Post
      Hi, could you please post your bowtie command, as well, for completeness?
      Here it is:

      ./bowtie -a -v 0 database SRR4493095_1.fastq out.txt

      Comment


      • #4
        Are you well aware of the options you are using in bwa?
        I am highlighting the ones that you are using or that you might consider using (from bwa 0.6.1):

        Code:
        bwa aln [options] <prefix> <in.fq>
        Options: [B]-n NUM    max #diff (int) or missing prob under 0.02 err rate (float) [0.04][/B]
                 -o INT    maximum number or fraction of gap opens [1]
                 -e INT    maximum number of gap extensions, -1 for disabling long gaps [-1]
                 -i INT    do not put an indel within INT bp towards the ends [5]
                 -d INT    maximum occurrences for extending a long deletion [10]
                 [B]-l INT    seed length [32][/B]
                 [B]-k INT    maximum differences in the seed [2][/B]
                 -m INT    maximum entries in the queue [2000000]
                 -t INT    number of threads [1]
                 -M INT    mismatch penalty [3]
                 -O INT    gap open penalty [11]
                 -E INT    gap extension penalty [4]
                 -R INT    stop searching when there are >INT equally best hits [30]
                 -q INT    quality threshold for read trimming down to 35bp [0]
                 -f FILE   file to write output to instead of stdout
                 -B INT    length of barcode
                 -c        input sequences are in the color space
                 -L        log-scaled gap penalty for long deletions
                 [B]-N        non-iterative mode: search for all n-difference hits (slooow)[/B]
                 -I        the input is in the Illumina 1.3+ FASTQ-like format
                 -b        the input read file is in the BAM format
                 -0        use single-end reads only (effective with -b)
                 -1        use the 1st read in a pair (effective with -b)
                 -2        use the 2nd read in a pair (effective with -b)
        hope that helps,
        cheers,
        Sophia

        Comment


        • #5
          Thanks. But, from the above,you mean to use N option ? Because, I use only n and k options, not others.

          Comment


          • #6
            Originally posted by Arupsss View Post
            Thanks. But, from the above,you mean to use N option ? Because, I use only n and k options, not others.
            yes.
            And -k indicates the number of mismatches in the seed, and therefore, 41 is an unusual value for that. Why did you set it to 41?

            see here:
            BWA manual

            cheers

            Comment


            • #7
              Originally posted by Arupsss View Post
              I am doing experiments with BWA and BowTie. Now, I am finding alignments using command:


              ./bwa aln -n 0 -k 41 database.fa SRR4493095_1.fastq > aln_sa.sai

              and

              ./bwa samse database .fa aln_sa.sai SRR4493095_1.fastq > out_sa.sam

              However, while BowTie gives 123 number of matches for 100 reads, BWA gives just 100 number of matches (one for each, does not reporting if match in another position found). Can anybody help me why is this happening and how to solve this ? I give -n 0 option because, I want to find matching allowing no mismatch.

              Thanks in advance.
              That's how bwa works. It only returns on position when a read maps multiple times, it just picks one randomly. However, one of the tags in the .sam entry will contain the other positions where the read mapped equally well.

              Comment

              Latest Articles

              Collapse

              • seqadmin
                Exploring the Dynamics of the Tumor Microenvironment
                by seqadmin




                The complexity of cancer is clearly demonstrated in the diverse ecosystem of the tumor microenvironment (TME). The TME is made up of numerous cell types and its development begins with the changes that happen during oncogenesis. “Genomic mutations, copy number changes, epigenetic alterations, and alternative gene expression occur to varying degrees within the affected tumor cells,” explained Andrea O’Hara, Ph.D., Strategic Technical Specialist at Azenta. “As...
                07-08-2024, 03:19 PM
              • seqadmin
                Exploring Human Diversity Through Large-Scale Omics
                by seqadmin


                In 2003, researchers from the Human Genome Project (HGP) announced the most comprehensive genome to date1. Although the genome wasn’t fully completed until nearly 20 years later2, numerous large-scale projects, such as the International HapMap Project and 1000 Genomes Project, continued the HGP's work, capturing extensive variation and genomic diversity within humans. Recently, newer initiatives have significantly increased in scale and expanded beyond genomics, offering a more detailed...
                06-25-2024, 06:43 AM

              ad_right_rmr

              Collapse

              News

              Collapse

              Topics Statistics Last Post
              Started by seqadmin, 07-10-2024, 07:30 AM
              0 responses
              25 views
              0 likes
              Last Post seqadmin  
              Started by seqadmin, 07-03-2024, 09:45 AM
              0 responses
              201 views
              0 likes
              Last Post seqadmin  
              Started by seqadmin, 07-03-2024, 08:54 AM
              0 responses
              211 views
              0 likes
              Last Post seqadmin  
              Started by seqadmin, 07-02-2024, 03:00 PM
              0 responses
              193 views
              0 likes
              Last Post seqadmin  
              Working...
              X