BGZF for fastq files

rulix

Junior Member

Join Date: Oct 2017

Posts: 2
- Share
- Tweet
#1

BGZF for fastq files

10-09-2017, 08:13 AM

Hi,

I have been working with very large fastq files and was thinking of ways that might speed the operations that I regularly perform on them: counting the number of reads, finding a specific sequence or a particular header.

I remembered looking at the Illumina documentation for the bcl2fastq2 utility and remembered that it now allows generation of fastq files with BGZF compression. It got me thinking that the bloc access option would allow multithreaded decompression of the fastqs and therefore susbtantially speed up those operations that I regularly must perform on all of our generated fastqs.

Does anybody know if there is any tool to do this? I have done a search but can't find anything that seems to fit the picture.

Thanks

--Raul
Tags: None
GenoMax

Senior Member

Join Date: Feb 2008

Posts: 7140
- Share
- Tweet
#2

10-09-2017, 08:47 AM

@Raul: I moved your question to a new thread to give it added visibility. BBMap suite is multi-threaded and can use pigz (parallel gzip) library and varying levels of compression.
Comment

Previous template Next

Topics	Statistics	Last Post
Cancer Metastasis: A Deep Dive into Cellular Plasticity by seqadmin Started by seqadmin, 04-11-2024, 12:08 PM	0 responses 25 views 0 likes	Last Post by seqadmin 04-11-2024, 12:08 PM
Proteogenomic Profiles Offer New Clues in Prostate Cancer by seqadmin Started by seqadmin, 04-10-2024, 10:19 PM	0 responses 28 views 0 likes	Last Post by seqadmin 04-10-2024, 10:19 PM
Novel Diagnostic Assay Enhances Ovarian Cancer Detection by seqadmin Started by seqadmin, 04-10-2024, 09:21 AM	0 responses 24 views 0 likes	Last Post by seqadmin 04-10-2024, 09:21 AM
Evolutionary Dynamics of Centromeres: A Comparative Genomic Analysis by seqadmin Started by seqadmin, 04-04-2024, 09:00 AM	0 responses 52 views 0 likes	Last Post by seqadmin 04-04-2024, 09:00 AM

Seqanswers Leaderboard Ad