Hi,
While merging 16S pair-ended miseq data (2x250) i would like to disable ambiguous bases. BBmerge documentation states the following "When there is a mismatch, the base chosen is the one with the higher quality value, or N if they are equal". I have at most 1000 sequences in this case out 80000 that are assigned an "N" so i will like to remove those.
I´m just wondering if it is possible with bbmerge to discard reads that fall into this category so i don´t get ambiguous bases, neither i want to choose between one base or the other, simply discard merged reads with N.
Note that my R1 and R2 reads do not contain ambiguous bases as i have cleaned them before merging.
Thanks,
While merging 16S pair-ended miseq data (2x250) i would like to disable ambiguous bases. BBmerge documentation states the following "When there is a mismatch, the base chosen is the one with the higher quality value, or N if they are equal". I have at most 1000 sequences in this case out 80000 that are assigned an "N" so i will like to remove those.
I´m just wondering if it is possible with bbmerge to discard reads that fall into this category so i don´t get ambiguous bases, neither i want to choose between one base or the other, simply discard merged reads with N.
Note that my R1 and R2 reads do not contain ambiguous bases as i have cleaned them before merging.
Thanks,
Comment