Maybe overlooking something but ...
when I use tophat with paired reads having a name as
I end up with the second part clipped and a arbitrary read number put instead in the resulting bam as SRR479052.5415964 which does not support marking optical duplicates with picard
Q: Can I preserve the full read address when using tophat with some magic argument? or should I parse both fastQ and bam to reconstitute this missing info?
Thanks for help
Stephane
when I use tophat with paired reads having a name as
@SRR479052.1 HWI-ST188:1:1101:1222:2140/1
Code:
READ_NAME_REGEX=[a-zA-Z0-9]+:[0-9]:([0-9]+):([0-9]+):([0-9]+).*.
Thanks for help
Stephane
Comment