Quality Control Essentials for Next-Generation Sequencing Workflows

Published: 02-10-2025, 01:58 PM
644 views
0 comments
- Share
- Tweet

Quality Control Essentials for Next-Generation Sequencing Workflows
Like all molecular biology applications, next-generation sequencing (NGS) workflows require diligent quality control (QC) measures to ensure accurate and reproducible results. Proper QC begins at nucleic acid extraction and continues all the way through to data analysis. This article outlines the key QC steps in an NGS workflow, along with the commonly used tools and techniques.

Nucleic Acid Quality Control
Preparing for NGS starts with isolating the target nucleic acids. Once extracted, the DNA or RNA must be assessed to confirm its suitability for library construction. For some applications, certain levels of impurities may be tolerated. Additionally, when samples are limited or valuable (e.g., FFPE or clinical samples), researchers may need to utilize their extractions regardless of quality concerns. However, for most NGS workflows, proper QC of isolated nucleic acids is essential to ensure sufficient concentration, purity, and integrity for reliable library preparation and downstream sequencing.

During this stage, researchers should first assess sample concentration to confirm extraction success and ensure compatibility with the concentration range required for library preparation. This is commonly done using spectrophotometric methods or fluorescent dyes. Spectrophotometers offer a convenient way to measure concentration and detect impurities, though accuracy can be reduced by contaminants like proteins. Fluorometric instruments provide greater accuracy in quantification but are unable to detect impurities.

The next step is to assess nucleic acid size and fragment distribution. While gel electrophoresis can be used for this purpose, microfluidic capillary systems such as Agilent’s TapeStation and Femto Pulse or QIAGEN’s QIAxcel are now preferred for their speed, accuracy, and high throughput. Microfluidic capillary electrophoresis also gives a fair estimate of nucleic acid concentration, as well as the detection of impurities like degradation or unwanted DNA contamination for RNA samples.

In cases where there are impurities, fragmentation, or a low concentration of nucleic acid, researchers can utilize several tools to improve their yields and clean up their samples. This is most often done using commercialized spin columns or magnet beads (e.g., AMPure XP beads) that capture the nucleic acid of interest and wash away impurities or nucleic acids outside of the desired size range. Other tools include size selection instruments for target enrichment of nucleic acids like Ranger® Technology from YourGene Health or Pippin HT from Sage Science.

Once the target nucleic acids have been isolated and purified, the next step is library preparations.

Post-Library Prep QC
Preparing an NGS library involves converting the target nucleic acids into a compatible form for the respective sequencing platform. While some QC steps may be included during longer library preparation workflows, most QC is performed afterward to ensure that the libraries are properly constructed and sequencing-ready. The type of QC required at this stage varies depending on the sequencing platform, but it typically includes verifying library concentration, fragment size distribution, and purity, as well as detecting any contaminants that could interfere with sequencing.

In general, the same QC tools and techniques used after nucleic acid extraction are also applied post-library prep. However, there is a key difference when it comes to measuring concentration: at this stage, researchers may opt for qPCR to obtain more accurate quantification. These qPCR assays provide precise concentration measurements by using specific primers that bind to adapter regions unique to functional library molecules. This ensures that the sequencer is not over- or underloaded, which can affect the quality and quantity of data.

A common concern during this stage is the presence of adapter dimers and other unwanted byproducts from library preparation, especially in workflows involving amplification or adapter ligation. If residual adapters or primer dimers are detected, additional cleanup steps, such as AMPure XP bead purification or size selection techniques (e.g., Sage Science’s Pippin Prep), can be employed to remove them.

The final QC step at this stage is ensuring that libraries are normalized before sequencing. Sequencing platforms like Illumina require libraries to be pooled at specific molar concentrations to achieve balanced sequencing coverage across samples. This normalization can be done through manual dilution or by using bead-based normalization methods or enzymatic approaches designed to equalize input concentrations before sequencing.

Post-Sequencing QC
QC doesn’t stop after sequencing. In fact, one of the most critical QC steps is evaluating the raw sequence data to identify potential issues before starting the analysis. This process begins with assessing the raw data, and FastQC is one of the most popular tools for this purpose¹. It provides important metrics such as base quality scores, GC content, overrepresented sequences, and sequence duplication levels. Another valuable tool is MultiQC, which aggregates QC reports from multiple sources (including FastQC) into a single, comprehensive summary². While MultiQC does not perform the analysis itself, it is particularly useful for saving time by compiling QC reports and visualizing trends across multiple datasets or samples.

After the initial assessment, the next step is trimming low-quality based and removing any adapter sequences. This improves overall read quality and prevents adapter contamination in downstream analysis. Trimming can be performed using tools like Trimmomatic, Skewer, Cutadapt, and Fastp, which can also provide quality profiling^3,4,5,6.

With the expansion of long-read sequencing platforms, several QC tools have been developed specifically for long-read data. Oxford Nanopore Technologies (ONT)-specific tools include PycoQC and Porechop^7,8. PycoQC computes metrics and generates interactive QC plots for ONT data, while Porechop is used for adapter trimming and quality filtering, though these tools are no longer supported.

For broader long-read QC needs, NanoPack provides visualization and processing tools for ONT and PacBio long-read data, while Filtlong improves QC by filtering low-quality, adapter-contaminated, or off-length reads^9,10. LongQC provides QC for both PacBio and ONT long reads, offering sample QC to assess data readiness and platform QC for sequencing performance evaluation¹¹. Finally, LongReadSum is a multi-threaded QC tool that delivers fast, comprehensive metrics and basecalling signal analysis for long-read sequencing data across major platforms, including ONT, PacBio, and Illumina Complete Long Reads¹².

References
Andrews, S. (2010). FastQC: A quality control tool for high throughput sequence data [Online]. Retrieved from http://www.bioinformatics.babraham.a...ojects/fastqc/

Ewels, P., Magnusson, M., Lundin, S., & Käller, M. (2016). MultiQC: Summarize analysis results for multiple tools and samples in a single report. Bioinformatics, 32(19), 3047–3048. https://doi.org/10.1093/bioinformatics/btw354

Bolger, A. M., Lohse, M., & Usadel, B. (2014). Trimmomatic: A flexible trimmer for Illumina sequence data. Bioinformatics, 30(15), 2114–2120. https://doi.org/10.1093/bioinformatics/btu170

Jiang, H., Lei, R., & Ding, S. W. (2014). Skewer: A fast and accurate adapter trimmer for next-generation sequencing paired-end reads. BMC Bioinformatics, 15, 182. https://doi.org/10.1186/1471-2105-15-182

Martin, M. (2011). Cutadapt removes adapter sequences from high-throughput sequencing reads. EMBnet Journal, 17(1), 10–12.

Chen, S., Zhou, Y., Chen, Y., & Gu, J. (2018). fastp: An ultra-fast all-in-one FASTQ preprocessor. Bioinformatics, 34(17), i884–i890. https://doi.org/10.1093/bioinformatics/bty560

Leger, A., & Leonardi, T. (2019). pycoQC, interactive quality control for Oxford Nanopore sequencing. Journal of Open Source Software, 4(34), 1236. https://doi.org/10.21105/joss.01236

Wick, R. R., Judd, L. M., Gorrie, C. L., & Holt, K. E. (2017). Completing bacterial genome assemblies with multiplex MinION sequencing. Microbial Genomics, 3(10), e000132. https://doi.org/10.1099/mgen.0.000132

De Coster, W., D’Hert, S., Schultz, D. T., Cruts, M., & Van Broeckhoven, C. (2018). NanoPack: Visualizing and processing long-read sequencing data. Bioinformatics, 34(15), 2666–2669. https://doi.org/10.1093/bioinformatics/bty149

Wick, R. R. (2018). Filtlong [Internet]. GitHub. Retrieved from https://github.com/rrwick/Filtlong
Tags: None
Please sign into your account to post comments.

Nine Things a Sample Prep Scientist Thinks About Before Sequencing

by SEQadmin2

I’m not a sequencing expert. I’m a purification scientist who uses NGS to evaluate workflows my group develops. With this perspective, we think about the sample first and the NGS workflow second. The sequencer is an exceptionally honest reporter, but it can only report on what you give it, so whether you get clean, interpretable data from an NGS workflow is largely determined before you begin.

Here are nine questions we think about, in roughly the order they matter, before...
- Channel: Articles
06-18-2026, 07:11 AM
From Collection to Sequencing: Why Sample Preparation and Preservation Define Sequencing Data

by SEQadmin2

Data variability is still an issue in sequencing technologies despite the advances in reproducibility and accuracy of these platforms. But the problem does not originate in the sequencing itself, but in the previous steps, before the sample reaches the sequencer.

The first step is collection, followed by preservation and sample preparation for analysis. Most scientists overlook those steps, but not being careful might just be skewing the experiment’s results.
...
- Channel: Articles
06-02-2026, 10:05 AM
Single-Cell Sequencing at an Inflection Point: Early Impacts of New Platforms and Emerging Trends

by SEQadmin2

With the launch of new single-cell sequencing platforms in 2026, the field stands at an exciting inflection point. This article surveys the most impactful advances in the field and discusses how they’re reshaping research in cancer, immunology, and beyond.

Introduction

Single-cell sequencing technologies have undergone remarkable advances over the past decade, transitioning from low-throughput experimental approaches to highly scalable platforms capable of...
- Channel: Articles
05-22-2026, 06:42 AM

Whole-Genome Sequencing Traces Faroe Islands Ancestry to a North Atlantic Founder Population

by SEQadmin2

Whole-genome sequencing of 40 individuals from the Faroe Islands has shed new light on how this remote North Atlantic population descended from an ancient...
- Channel: News
06-17-2026, 06:09 AM
Sequencing the Two-Toed Sloth Genome Reveals Jumping Genes Tied to Its Extreme Metabolism

by SEQadmin2

Sloths are the slowest mammals on Earth, and their dense jungle habitat has made them notoriously difficult to study. Now, for the first time, scientists...
- Channel: News
06-09-2026, 11:58 AM
A New Method Makes Hantavirus Genome Analysis Faster and More Accessible

by SEQadmin2

Hantavirus infections are rare—roughly 30 people are infected in the United States each year—but they are deadly, killing 30 to 40 percent of those...
- Channel: News
06-05-2026, 10:09 AM
A New Single-Cell Method Maps DNA-Protein Interactions

by SEQadmin2

Scientists at Weill Cornell Medicine and the New York Genome Center have developed a new method that maps, in single cells, the DNA binding sites of transcription...
- Channel: News
06-04-2026, 08:59 AM

Unconfigured Ad

Quality Control Essentials for Next-Generation Sequencing Workflows

Quality Control Essentials for Next-Generation Sequencing Workflows

About the Author

Latest Articles

ad_right_rmr

News