Seqanswers Leaderboard Ad

**SnapSeq** · 04-16-2024, 06:45 AM

Which version of HiSeq did you use previously? The HiSeq 4000, NovaSeq, and NextSeq 2000 all utilize a newer clustering chemistry known as Exclusion Amplification (in most Illumina docs as ExAmp) that goes through rapid seeding on the flowcell and clusters immediately to occupy the microwells before other templates seed. MiSeqs, NextSeq 500s, and HiSeq 2000/2500s use random seeding that don't favor specific fragments sizes. This rapid seeding during ExAmp favors short fragments seeding first - if you map their positions on the flow cell, you should see that the insert length at the start of the lane is shorter than at the end of the lane.

**jeanlain** · 04-16-2024, 07:03 AM

Thanks for the reply. I think we were using HiSeq 2500.

**SnapSeq** · 04-16-2024, 09:14 AM

This is an old article, but the author thoroughly explains some of the drawbacks of ExAmp, including the short fragment bias.

(almost) everything you wanted to know about @illumina HiSeq 4000...and some stuff you didn't

https://core-genomics.blogspot.com/2016/01/almost-everything-you-wanted-to-know.html

The HiSeq 4000 was Illumina's way of making the patterned flowcell technology available to non X Ten customers, and opening up patterned ...

**jeanlain** · 04-16-2024, 09:32 AM

Thanks.
As you can see from my first post, the bias towards shorter fragments is very strong. Is it always that strong? I don't see many people complaining about it, but it's a big problem if half of your sequences are duplicated because mate overlap.

**SnapSeq** · 04-18-2024, 11:31 AM

I don't know that anyone has measured how extreme the bias is for library fragments. If I remember correctly Illumina published some rough numbers early on stating that adapter dimers (much shorter than a library) could take up 5-10x more of your reads on a patterned flow cell compared to what they did on a nonpatterned, but I don't know how to find where I read that initially.

The closest I've found is point 3 on this post about HiSeq 4000 services, which says that 1% dimer can translate to 6% of reads, and 10% dimer up to 84% of reads.

HiSeq 3000 / HiSeq 4000 Services

https://genohub.com/services/sequencing/illumina-hiseq-3000-4000/

Illumina HiSeq 3000 HiSeq 4000 instrument: considerations, limitations and service prices.

My recommendation would be to fragment gDNA less to create larger inserts, or if you're performing a double-sided cleanup at the end of library prep to generate the profile in the electropherogram you posted above, adjust your ratios to eliminate more short fragments and shift the distribution to the right. If short fragments aren't present, the bias won't allow them to be over-represented.

**jeanlain** · 04-20-2024, 01:12 AM

Thanks for the recommendation. We don't prepare the libraries ourselves, we just send genomic DNA to sequencing platforms. We may ask to maximize insert size.
An analysis of the selection bias would be helpful to publish, as the problem may be important. It can greatly impact the amount of useful sequence data you obtain. Not only the number of different bases that are sequenced can be much less than 2x150 per read pair, you would end up with nothing useful if the read pair has a mapping quality of zero because the effective sequence is too short and can map at different locations with equal score. In the end, you may lose a lot of data.

**asd** · 04-24-2024, 06:24 AM

Dear all, which sequencing platform do you recommend for isolated pathogenic bacteria Illumina NovaSeq 6000,Illumina NextSeq 550 platform? We intend to explore virulence genes and resistance genes and all SNP and variants? ?

Topics	Statistics	Last Post
The Role of Enhancers in Defining Cell Fate by seqadmin Started by seqadmin, Today, 10:49 AM	0 responses 9 views 0 likes	Last Post by seqadmin Today, 10:49 AM
Expanding the Horizons of Cellular Research with the Single Cell Atlas by seqadmin Started by seqadmin, 04-25-2024, 11:49 AM	0 responses 21 views 0 likes	Last Post by seqadmin 04-25-2024, 11:49 AM
Genetic Variants and Diabetes Risk in Childhood Cancer Survivors by seqadmin Started by seqadmin, 04-24-2024, 08:47 AM	0 responses 20 views 0 likes	Last Post by seqadmin 04-24-2024, 08:47 AM
Cancer Metastasis: A Deep Dive into Cellular Plasticity by seqadmin Started by seqadmin, 04-11-2024, 12:08 PM	0 responses 62 views 0 likes	Last Post by seqadmin 04-11-2024, 12:08 PM

Seqanswers Leaderboard Ad

Announcement

Novaseq 600 very low insert size

Comment

Comment

Comment

Comment

Comment

Comment

Comment

Latest Articles

ad_right_rmr

News