Hello,
We have a sample containing several bacterial species and we want to uniquely map RNA-seq reads to the genomes of each of our organisms to get the expression patterns of each organism separately. We tried to use BWA (implemented in "Galaxy", poor GUI-users that we are) with the “edit distance” (aln -n in the command line version) set to 0 but none of the reads were mapped (all had the SAM tag set to “4’). This is an artifact since running BLAST with some of the sequences showed that they have 100% identity to one of our genomes and not any others, so they should map uniquely.
Does the "aln -n" option really determine the number of mismatches? Any ideas why BWA will not run well using –n=0?
Thanks
Daniel
We have a sample containing several bacterial species and we want to uniquely map RNA-seq reads to the genomes of each of our organisms to get the expression patterns of each organism separately. We tried to use BWA (implemented in "Galaxy", poor GUI-users that we are) with the “edit distance” (aln -n in the command line version) set to 0 but none of the reads were mapped (all had the SAM tag set to “4’). This is an artifact since running BLAST with some of the sequences showed that they have 100% identity to one of our genomes and not any others, so they should map uniquely.
Does the "aln -n" option really determine the number of mismatches? Any ideas why BWA will not run well using –n=0?
Thanks
Daniel
Comment