Seqanswers Leaderboard Ad

**Brian Bushnell** · 11-11-2015, 10:26 PM

The length should be loosely coupled with the loading, since small molecules can outcompete large ones... or so I've been told. Looks like in your tests that's not the case. It probably depends strongly on your size distribution; maybe you don't have many small molecules.

However, I see no reason why the quality should be in any way related to the loading. Where did you hear that from?

Also - what kind of movie length are you running? If you're constrained by disposable costs rather than platform time, you can always run longer and generate a bit more data...

**zhiliangc** · 11-11-2015, 10:59 PM

Originally posted by Brian Bushnell View Post

The length should be loosely coupled with the loading, since small molecules can outcompete large ones... or so I've been told. Looks like in your tests that's not the case. It probably depends strongly on your size distribution; maybe you don't have many small molecules.

However, I see no reason why the quality should be in any way related to the loading. Where did you hear that from?

Also - what kind of movie length are you running? If you're constrained by disposable costs rather than platform time, you can always run longer and generate a bit more data...

Thanks for the reply.

When the machine arrived in our sequencing centre, PacBio gave us some bioinformatics training about sequence analysis. And we were told at that time the overloading will lead to more in-dels as the ZMWs might be clogged so the optimal loading would be at P1 of ~37%. However, we did here from some PacBio technicians that the optimal P1 would be 44% when they gave the wetlab training to our sequencing centre.

And yes we're not worrying much about machine time so the cost saving will be aiming for least number SMRT cells. We're already running 6h movies so I don't think we can go beyond that?

**rhall** · 11-12-2015, 12:21 PM

37% P1 vs. 45% is open for debate. 37% is the optimal value for P1 (minimal P0 and P2) given a perfect Poisson distribution, but the complexities of loading result in this rarely being a reality. The parameter to watch is P2, increasing percentage of P2 will result in lower quality data. An increase in yield with a significant increase in P2 is not a good approach for denovo assembly. For the plots that you show I would expect plotting P2 instead of P1 would be more informative. I'm guessing that the cells 7-15, which show a reduced N50 with higher P1 have higher P2 compared with the later high P1 cells that don't show any effect on N50.

**zhiliangc** · 11-12-2015, 04:09 PM

Originally posted by rhall View Post

37% P1 vs. 45% is open for debate. 37% is the optimal value for P1 (minimal P0 and P2) given a perfect Poisson distribution, but the complexities of loading result in this rarely being a reality. The parameter to watch is P2, increasing percentage of P2 will result in lower quality data. An increase in yield with a significant increase in P2 is not a good approach for denovo assembly. For the plots that you show I would expect plotting P2 instead of P1 would be more informative. I'm guessing that the cells 7-15, which show a reduced N50 with higher P1 have higher P2 compared with the later high P1 cells that don't show any effect on N50.

I've got updated plotting with both P1 and P2 now.
But judging from the plotting, P2 doesn't seem to correlate with read length or yield stats? I don't have a good reference to compare to so it's hard to check the sequencing quality at this stage. Just of curiosity, when you talk about "increasing percentage of P2 will result in lower quality data", what kind of quality measuring do you use? # of indels? # of erroneous bases?

Attached Files

**rhall** · 11-12-2015, 05:13 PM

Interesting, looking at the P2% it does not appear that it ever gets high enough to have an effect on N50 so it likely isn't having much of an effect in any of the runs on overall quality. I was probably over interpreting stochastic noise. If you can keep the P1 in the 40-55% range, with P2 ~10-15 as in that plot you shouldn't have any problems with data quality for denovo assembly.
The issue with P2 and data quality is two fold. Firstly the read length is reduced. P2 is a single number, but in reality a ZMW can go from P2->P1 in the process of a run, imagine a ZMWs has two polymerases, what happens is that in the process of the run one stops sequencing. You then start generating sequence from the remaining polymerse, but you don't have as long a time to sequence so the readlengths are shorter. Secondly the detection of multiple loading is not perfect, so it is possible to generate sequence from cross talking polymerases, this reduces accuracy across all error modes. Depending on the experiment this may not be too much of an issue, but in extreme situations it can be very deleterious. It mostly manifests in requiring higher coverage to generate the same quality consensus, or quiver convergence problems.

**zhiliangc** · 11-12-2015, 07:08 PM

Thanks for the comments.

Bringing P1 up to ~50% and keeping P2 at ~10-15 sounds like a good plan. In practise will be hard to hit it perfectly every time, but we’ll see what we can do.

Topics	Statistics	Last Post
Expanding the Horizons of Cellular Research with the Single Cell Atlas by seqadmin Started by seqadmin, Yesterday, 11:49 AM	0 responses 15 views 0 likes	Last Post by seqadmin Yesterday, 11:49 AM
Genetic Variants and Diabetes Risk in Childhood Cancer Survivors by seqadmin Started by seqadmin, 04-24-2024, 08:47 AM	0 responses 16 views 0 likes	Last Post by seqadmin 04-24-2024, 08:47 AM
Cancer Metastasis: A Deep Dive into Cellular Plasticity by seqadmin Started by seqadmin, 04-11-2024, 12:08 PM	0 responses 61 views 0 likes	Last Post by seqadmin 04-11-2024, 12:08 PM
Proteogenomic Profiles Offer New Clues in Prostate Cancer by seqadmin Started by seqadmin, 04-10-2024, 10:19 PM	0 responses 60 views 0 likes	Last Post by seqadmin 04-10-2024, 10:19 PM

Seqanswers Leaderboard Ad

Announcement

P1 value vs read quality

Comment

Comment

Comment

Comment

Comment

Comment

Latest Articles

ad_right_rmr

News