maybe there is a better place for bug report but I couldn't find a mailing list of bug tracker
My uderstanding of the Sequence trimming is ti looks for a match (allowing mismatches) between the leftpart of the read and the right part of the adapter. but it seems to fail finding a match as long as the adapter.
Here are a few command that reproduce the issue:
Code:
>>> from HTSeq import Sequence >>> read=Sequence('ACACGTTCGATATCCCGTATGCAACGGACCCGGCAGGAAACCGGCTGTGGG') >>> adapter1=Sequence('ACACGT') >>> adapter2=Sequence('AACACGT') >>> print read.seq.startswith(adapter1.seq) True >>> print read.seq.startswith(adapter2.seq) False >>> print read.trim_left_end(adapter1) ACACGTTCGATATCCCGTATGCAACGGACCCGGCAGGAAACCGGCTGTGGG >>> print read.trim_left_end(adapter2) TCGATATCCCGTATGCAACGGACCCGGCAGGAAACCGGCTGTGGG
Leave a comment: