I have Illumina pair-ended seq data. I try to remove duplicates from each DNA fragment.
For example, I have two pairs reads. A1 A2 and B1 B2. After Bowtie alignment, I have data as follows:
pair A ----------- chr ----------- strand ----------- position ----------- mate
A1 ----------- chr3 ----------- plus ------------- 37 ----------- 1
A2 ----------- chr3 ------------ minus ----------- 137 ----------- 2
pair B ----------- chr ----------- strand ----------- position ----------- mate
B2 ----------- chr3 ----------- plus --------------- 37 ----------- 2
B1 ----------- chr3 ------------ minus ----------- 137 ----------- 1
The only difference between pair A and pair B is the mate. Are pair A and pair B are duplicated? Thanks!
For example, I have two pairs reads. A1 A2 and B1 B2. After Bowtie alignment, I have data as follows:
pair A ----------- chr ----------- strand ----------- position ----------- mate
A1 ----------- chr3 ----------- plus ------------- 37 ----------- 1
A2 ----------- chr3 ------------ minus ----------- 137 ----------- 2
pair B ----------- chr ----------- strand ----------- position ----------- mate
B2 ----------- chr3 ----------- plus --------------- 37 ----------- 2
B1 ----------- chr3 ------------ minus ----------- 137 ----------- 1
The only difference between pair A and pair B is the mate. Are pair A and pair B are duplicated? Thanks!
Comment