From Algorithms to Assemblies: An Interview with Sequencing Analysis Experts—Part 6

Published: 04-13-2023, 06:35 AM
276 views
0 comments
- Share
- Tweet

From Algorithms to Assemblies: An Interview with Sequencing Analysis Experts—Part 6
Welcome to part six of our Q&A article series with leading sequencing analysis providers. We’re interviewing these experts to gain helpful insights into their complex analysis processes.

In this final installment of our series, we ask our participants about one of the most important aspects of data analysis, accuracy and reproducibility.

If you’re just joining us, we recommend reviewing the first installment on quality control, the second installment covering alignments and assemblies, the third installment on transcript analysis, the fourth installment on data visualization, and the fifth installment on the latest trends in sequencing analysis.

What steps do you take to ensure that your analysis and pipelines are accurate and reproducible?

Richard Moir, Director of Product and Technology, Geneious

The Geneious team prioritizes scientific accuracy above all else and we ensure this by bringing together the best commercial software engineering practices such as automated testing, continuous integration and peer reviews with ample scientific knowledge provided by experienced biologists that fill key roles such as product managers and quality advocates in the dev team. Our helpdesk is also staffed by PhD qualified molecular biologists to advise users on accurate use of our tools and interpretation of the results.

Reproducibility is also an important part of the Geneious way of working with a host of features that help in this respect:
New result documents are saved at each step of an analysis and the settings that were used are stored on each document for future reference, creating an audit trail for your analysis operations.

Result documents keep a reference to all input documents and vice-versa meaning inputs and outputs can be reliably tracked.

Analysis options can be saved as a preset and those presets can be shared with colleagues.

Workflows allow creation of standard operating procedures to allow easy reproduction of an analysis pipeline.

As a desktop tool, it is always possible to run previous versions of the software when necessary and we make all versions available for easy download from our website.

Dr. Ni Ming, Senior Vice-President, MGI

In terms of accuracy, as mentioned in a previous answer, QC preprocessing of data ensures that the analysis data is as accurate as possible. The use of T2T genome and comparison software with higher levels of accuracy further add to this, while AI algorithms help to continuously improve accuracy.

On the other hand, mutation detection software will perform random downsampling operations on high-depth data to a certain extent and employ random functions, etc. to generate unreproducible results. You can cancel downsampling, modify the use of random functions, etc. to ensure reproducibility.

Simon Valentine, Chief Commercial Officer, Basepair

The Basepair platform offers analysis workflows for a variety of genomic datatypes (RNA-seq, ChIP-seq, ATAC-seq, scRNA-seq, WGS/WES, etc.). We leverage industry-standard tools available in the public domain that have been cited in hundreds of peer-reviewed publications.

Our workflows are always validated by first processing a series of published datasets from different species and of different overall quality to ensure they consistently reproduce the expected results. In order to trust the final results of an analysis, not only must the data pass various QC checks along the way, but the pipeline itself must also be carefully evaluated.

QIAGEN Digital Insights Team

We at QIAGEN Digital Insights have a team of PhD-level bioinformaticians and developers who pride themselves on developing high-quality tools. Unlike some open-source tools that result from a graduate-student thesis or project, the tools we produce are fully supported and follow strict development methodologies.

We use robust software development processes under ISO27001 standards and routinely test our workflows against standard datasets to ensure high quality and reproducibility. QIAGEN CLC Genomics Workbench also provides an audit trail so you can always look back at the settings and parameters used for analysis for maximum reproducibility.

Mike Lelivelt, VP of Software Product Management and Marketing, Illumina

In DRAGEN, over 200 smoke tests automatically run every night, and over 3500 automated test cases run every weekend. This ensures we capture issues early. We use “golden” dataset (such as GIAB data) as the truth to evaluate our pipeline accuracy. In the automation test runs that occur every weekend, if there is any regression from a previous run, the team will take action to address it. This ensures no regression in our pipeline accuracy, and the accuracy trend is always upward.

Robustness tests run the same test multiple times and ensure they generate the same result. We also run the same test across different platforms (AWS, Azure, different DRAGEN servers, BaseSpace Sequence Hub, Illumina Connected Analytics etc.) and make sure the results are consistent. DRAGEN development follows the Illumina Quality Management System. IVD compliant DRAGEN NGS analysis applications can be used as a component of Illumina and customer IVD solutions.
Tags: None
Please sign into your account to post comments.

Advanced Sequencing Platforms Tackle Neuroscience’s Toughest Genomics Problems

by SEQadmin2

Genomics studies in neuroscience face a special challenge due to the brain’s complexity and scarcity of samples. Mapping changes in cell type and state using conventional next-generation sequencing methods remains challenging. Advances in technologies like single-cell sequencing, spatial transcriptomics, and long-read sequencing have opened the door to deeper studies of the brain and diseases like Alzheimer’s, amyotrophic lateral sclerosis (ALS), and schizophrenia.
...
- Channel: Articles
07-09-2026, 11:10 AM
Cancer Drug Resistance: The Lingering Barrier to Rising Survival

by SEQadmin2

Cancer survival rates have significantly increased in the last few decades in the United States, reaching a combined 70% 5-year survival rate by 2021. Behind this number, there are years of research to find new therapies, drug targets, and early detection methods. But there is one core challenge that keeps slowing down these advances, and it’s about drug resistance.

There is no single reason why many patients don’t respond to treatment as expected. Cancer is...
- Channel: Articles
07-08-2026, 05:17 AM
Nine Things a Sample Prep Scientist Thinks About Before Sequencing

by SEQadmin2

I’m not a sequencing expert. I’m a purification scientist who uses NGS to evaluate workflows my group develops. With this perspective, we think about the sample first and the NGS workflow second. The sequencer is an exceptionally honest reporter, but it can only report on what you give it, so whether you get clean, interpretable data from an NGS workflow is largely determined before you begin.

Here are nine questions we think about, in roughly the order they matter, before...
- Channel: Articles
06-18-2026, 07:11 AM

New Analysis Splits Leukemia Into 16 Epigenomic Subgroups

by SEQadmin2

Acute myeloid leukemia (AML) is one of the most aggressive of all blood cancers, and how it is classified helps determine how each patient is treated....
- Channel: News
07-09-2026, 10:04 AM
Genome-Wide CRISPR Screen Uncovers Unlikely Psoriasis Target

by SEQadmin2

Biohub researchers performed what they believe is the first genome-wide CRISPR study of primary human adult skin cells, then used an AI model to mine...
- Channel: News
07-08-2026, 10:08 AM
Engineered Protein Motor Takes Its First Steps Along DNA Track

by SEQadmin2

An international team led by Lund University and the University of New South Wales has built an artificial protein motor that takes controlled, directional...
- Channel: News
07-07-2026, 11:05 AM
High-Resolution Sequencing Exposes Hidden Toxoplasma Diversity

by SEQadmin2

An international research team has used high-resolution sequencing to reveal previously hidden genetic diversity in Toxoplasma gondii. The study, conducted...
- Channel: News
07-02-2026, 11:08 AM

Unconfigured Ad

From Algorithms to Assemblies: An Interview with Sequencing Analysis Experts—Part 6

From Algorithms to Assemblies: An Interview with Sequencing Analysis Experts—Part 6

About the Author

Latest Articles

ad_right_rmr

News