I'm attaching two plots from a sample with ~152 million reads, one truncated at a distance of 10000 and another going all the way to 50000. For what it's worth, I noticed that the Broad is using a distance of 2500 for patterned flow cell, which seems pretty reasonable. If one enables tile spanning, then you don't see saturation until ~20000, which seems a bit over the top.
The tile spanning results seem a bit over the top, though interestingly a distance of 1 is sufficient to find stuff with that enabled. I'll post the density of duplicates according to X/Y coordinate to ensure there's no NextSeq-like tile edge effect.
Update: Yup, no tile-edge effect, so not spanning tiles makes sense.
The tile spanning results seem a bit over the top, though interestingly a distance of 1 is sufficient to find stuff with that enabled. I'll post the density of duplicates according to X/Y coordinate to ensure there's no NextSeq-like tile edge effect.
Update: Yup, no tile-edge effect, so not spanning tiles makes sense.
Comment