I have a sequence and would like to find unique regions (when compared to a human database) within it that are at least 11 amino acids long. Any leads on how to address this?
My current thinking is to use blast to the human protein database and find region(s) that have no similarity to any other sequence, but I am wondering if there are other, more principled ways.
Ultimately, the goal is to find a unique region so one can design an antibody that would be specific to the protein.
My current thinking is to use blast to the human protein database and find region(s) that have no similarity to any other sequence, but I am wondering if there are other, more principled ways.
Ultimately, the goal is to find a unique region so one can design an antibody that would be specific to the protein.
Comment