Originally posted by pmiguel
View Post
Seqanswers Leaderboard Ad
Collapse
Announcement
Collapse
No announcement yet.
X
-
Originally posted by Brian Bushnell View PostThis is kind of tangential to NovaSeq, but...
I've suggested that we keep everything on ice whenever possible prior to sequencing, due to the fact that low temperatures retard any kind of activity and thus should inhibit adapter-swapping (which is a huge problem as we run a lot of highly-amplified single cells). But my explanations were too vague to be taken seriously, since I don't know the specifics of the reactions. I would love to have a very clear (and preferably lengthy, rather than concise) explanation of exactly why and when keeping pools on ice should prevent crosstalk, that I can copy and paste (attributing credit, if desired) to the people in charge of making libraries.
I think it is obvious that the longer you let a mixed batch of libraries sit around, and the higher the temperature, the more index-swapping will occur, regardless of the mechanism. But without citing a specific mechanism (and it does not really matter if it is the dominant one), nobody involved with library prep will pay attention to my concerns on the issue (meaning, no tests of ice vs no ice). All I really need is a real mechanism, which seems sufficiently important to cause a test to be run; once that occurs, I'll be satisfied, even if the results are negative and indicate that keeping pooled libraries at a high temperature for a long time seems to be optimal for preventing crosstalk. Not that I'll believe negative results unless I run the experiment myself, but at least I'll believe I did my best. I'll still report the results here.
Comment
-
Index hopping is the result of annealed oligo extension by ExAmp. I do not know the details of ExAmp but KAPA HiFi polymerase under stringent cycling condition is able to extend primers as long as the 3’ base and other 6 bases in the 10 base region of 3’ is complementary even though the rest of the oligo is not a match and just hangs off the template.
Left over adapter oligos, PCR primers, single-adapted and non-adapted fragments can act as oligo and result in index hopping, neutral and cluster forming fusion fragments, respectively. So presence of high concentration of oligos acting as primers and longer incubation of library pool will increase these artifacts. I also would expect to see more fusion with PCR-free libraries as the proportion of fragments without adapters in both end are higher in comparison to PCR amplified libraries.
Comment
-
There've been a few hypotheses that ExAmp is actually Recombinase Polymerase Amplification (RPA), developed by TwistDX.
Here's a Youtube video describing it
It makes sense to me. And is semi-described in one of Illumina's patents that James Hadfield reviewed on his blog.
Comment
-
Originally posted by Brian Bushnell View PostWe have several theories for what was driving this on HiSeq... the most plausible being something like, "library A had too many unincorporated adapters,
Originally posted by Brian Bushnell View Postlibrary B had too many adapter-free inserts, and after mixing them, library B adopted some of the free adapters from library A".
Remember, DNA can be converted back and forth from single-stranded to double-stranded without the intervention of any enzyme if the right temperature/salt/concentration is present. The hydrogen-bond-guided interactions between the bases of reverse-complementary strands of DNA are reversible under these conditions.
The process of breaking the phophodiester/ribose backbone requires much more energy. Joining DNA strands via their backbone pretty much requires an enzyme.
Originally posted by Brian Bushnell View PostWhich would indicate that it involves both the donor and recipient library. But I'm not sure if that mechanism is important for NovaSeq.
--
Phillip
Comment
-
Originally posted by nucacidhunter View PostIndex hopping is the result of annealed oligo extension by ExAmp. I do not know the details of ExAmp but KAPA HiFi polymerase under stringent cycling condition is able to extend primers as long as the 3’ base and other 6 bases in the 10 base region of 3’ is complementary even though the rest of the oligo is not a match and just hangs off the template.
Left over adapter oligos, PCR primers, single-adapted and non-adapted fragments can act as oligo and result in index hopping, neutral and cluster forming fusion fragments, respectively. So presence of high concentration of oligos acting as primers and longer incubation of library pool will increase these artifacts. I also would expect to see more fusion with PCR-free libraries as the proportion of fragments without adapters in both end are higher in comparison to PCR amplified libraries.
I'm not really following why we need to posit either low stringency annealing event nor actual ligations (as the mechanism described in Brian's post would require) to explain index hopping. If there is any amplification occurring anywhere but tethered to the surface of the flowcell, then unincorporated adapter oligos could anneal and be extended, creating a "cross-over event" that would generate an index hopped library molecule. If that molecule seeded a cluster, then we would have an index hop.
--
Phillip
Comment
-
Illumina’s white paper on index hopping https://www.illumina.com/content/dam...inkId=36607862 shows that adding adapters not used in library prep increases index hopping with increased spike in of adapters. These adapters will be dissociated to single stranded oligos during denaturing. The oligos will be complementary to adapted library fragments in a maximum stretch of ~30 nt just before the adapter index sequences which indicates that index hopping can occur when relatively large overhang is present. I am not sure about how many bases need to anneal for an extension event but giving high processivity of ExAmp it might be a short stretch.
Chimerism will happen if the 3’ end of a fragment anneals to other fragments and is extended so fragments with adapters at both ends even with high similarity will not cause cause fusion. For my hypothesized mechanism then PCR-free libraries will be more prone to index hopping and chimerism. Indeed, Illumina data https://www.illumina.com/science/edu...x-hopping.html indicates higher index hopping for PCR-free libraries but they have not investigated chimerism events.
Index hopping is possible to happen on the flow cell tethered fragments but they would contribute if they seed another well on the flow cell. Wells with chimeras and multiple indices will have low quality sequences and more likely will be filtered in read processing steps.
Comment
-
Originally posted by nucacidhunter View PostIllumina’s white paper on index hopping https://www.illumina.com/content/dam...inkId=36607862 shows that adding adapters not used in library prep increases index hopping with increased spike in of adapters.
Originally posted by nucacidhunter View PostThese adapters will be dissociated to single stranded oligos during denaturing.
Originally posted by nucacidhunter View PostThe oligos will be complementary to adapted library fragments in a maximum stretch of ~30 nt just before the adapter index sequences which indicates that index hopping can occur when relatively large overhang is present.
Originally posted by nucacidhunter View PostI am not sure about how many bases need to anneal for an extension event but giving high processivity of ExAmp it might be a short stretch.
I don't doubt that many polymerases can extend from an oligo annealed over just a handful of bases. But an oligo annealed via a very short area of complementarity will do so with little stability unless the conditions of hybridization are such that they allow such this interaction. For example high salt concentrations can shield the negative phosphate backbone charges and thereby dampen that force which tends to tear the strands apart from one another.
Of course it is possible to lower the stringency of primer annealing of an amplification to allow just a few bases of homology to prime an extension event. But I can't think of any reason to do so during cluster formation -- it would allow various types of undesired mis-priming events that would be very undesirable. So I would doubt that Illumina would use such conditions.
--
Phillip
Comment
-
Maybe I am not using the correct terminology but by processivity I meant the polymerisation speed. For instance, some brands will extend a primer 1kb/min while others can do 3kb/min. Speedy polymerases specially with activity at suboptimal temperatures tend to extend less complimentary primers because the extension progresses before weakly bound unstable primers dissociates.
Comment
-
Large insert size on NovaSeq
Originally posted by Brian Bushnell View PostI did a comparison of duplicate rates on HiSeq2500 and NovaSeq, using Illumina's public data on BaseSpace:
NovaSeq seems to have a problem, but it's not clear why. These are not normal optical/well duplicates; they are extremely remote. It looks like during colony formation, some reads break off and reattach to an empty well somewhere else. The farthest-right point (at 25000) is not for distance 25000 but for distance infinity, including inter-tile duplicates.
These libraries are PCR-free WGS and thus should not really have more than a tiny fraction of duplicates, as seen on the HiSeq. Does anyone have any idea what's causing this? Does my hypothesis sound reasonable? Previous Illumina platforms had a very obvious distance cutoff where the number of duplicates increases rapidly up to a point, then plateaus (which is true for this HiSeq data, at around dist=45, but you can't see it in this graph). That is not the case for NovaSeq - it just keeps ascending, and there is no clear cutoff. It gradually bends, so there is no clear inflection point like there is on other platforms.
For reference, the libraries are both human NA12878 runs. NovaSeq is 2x150 and HiSeq 2500 is 2x100. Pairs are considered duplicates when the distance between colony centers is at most the stated distance, and both R1 and R2 match with some number of substitutions allowed, to account for sequencing error (8 for 150bp reads and 5 for 100bp reads). The insert sizes are quite large on average (>500bp) which reduces the rate of coincidental duplicates. HS2500 is ~10x and NovaSeq is ~30x coverage so the coincidental duplicate rate should be extremely low in both cases.
P.S. This is an underestimate of the duplicate rate for both platforms, as it was generated in a way that is not robust to sequencing error. I will regenerate the data, but it won't change the discrepancy, just the magnitude.
This is a bit off-topic, but your comment was the only thing I could find online about sequencing libraries with larger-than-recommended library fragment sizes on the NovaSeq. I prepared a TruSeq Nano library with unusually large fragment sizes (BioAnalyzer trace is attached), which I've done before and sequenced without issue on the MiSeq (seems to be a quirk of my setup as I follow Illumina protocol as closely as I can). This time I'd like to sequence on the NovaSeq, and I noticed this in the Illumina bulletin on migrating libraries between different sequencing platforms:
"Some applications with 550 bp or greater insert sizes are compatible with the NovaSeq platform, but additional optimization steps may be required."
Are you familiar with the optimization steps that they refer to? I am also in contact with Illumina technical support here in the UK regarding this and they are warning me that I should re-prep the library because I will get overclustering and rubbish results, but I'd rather not of course, especially considering that I successfully sequenced a library with nearly identical out-of-spec fragment size on the MiSeq. I imagine you might know a thing or two about this?Attached Files
Comment
Latest Articles
Collapse
-
by seqadmin
The field of immunogenetics explores how genetic variations influence immune responses and susceptibility to disease. In a recent SEQanswers webinar, Oscar Rodriguez, Ph.D., Postdoctoral Researcher at the University of Louisville, and Ruben Martínez Barricarte, Ph.D., Assistant Professor of Medicine at Vanderbilt University, shared recent advancements in immunogenetics. This article discusses their research on genetic variation in antibody loci, antibody production processes,...-
Channel: Articles
11-06-2024, 07:24 PM -
-
by seqadmin
Next-generation sequencing (NGS) and quantitative polymerase chain reaction (qPCR) are essential techniques for investigating the genome, transcriptome, and epigenome. In many cases, choosing the appropriate technique is straightforward, but in others, it can be more challenging to determine the most effective option. A simple distinction is that smaller, more focused projects are typically better suited for qPCR, while larger, more complex datasets benefit from NGS. However,...-
Channel: Articles
10-18-2024, 07:11 AM -
ad_right_rmr
Collapse
News
Collapse
Topics | Statistics | Last Post | ||
---|---|---|---|---|
Started by seqadmin, 11-08-2024, 11:09 AM
|
0 responses
59 views
0 likes
|
Last Post
by seqadmin
11-08-2024, 11:09 AM
|
||
Started by seqadmin, 11-08-2024, 06:13 AM
|
0 responses
38 views
0 likes
|
Last Post
by seqadmin
11-08-2024, 06:13 AM
|
||
Started by seqadmin, 11-01-2024, 06:09 AM
|
0 responses
35 views
0 likes
|
Last Post
by seqadmin
11-01-2024, 06:09 AM
|
||
New Model Aims to Explain Polygenic Diseases by Connecting Genomic Mutations and Regulatory Networks
by seqadmin
Started by seqadmin, 10-30-2024, 05:31 AM
|
0 responses
23 views
0 likes
|
Last Post
by seqadmin
10-30-2024, 05:31 AM
|
Comment