Thank you for your prompt reply!
There are 150-200 mill reads in each of the paired fastq files and I just expected that to be quite redundant.
Seqanswers Leaderboard Ad
Collapse
Announcement
Collapse
No announcement yet.
X
-
If you split your fastq, you aren't going to get a good assembly. You really want a computer with more memory, so it can handle the whole fatq.
If you really need to split it, use unix built-in programs.
Code:split -l 40000000 myfastq.fq
Leave a comment:
-
splitting big paired fastq files
Hi there,
I do my 'bioinformatic' work in CLC. Now I sit with many (30) large files with paired end reads (~10GB each direction) and my computer is stalling if I'd try to use all in a de novo assembly. Hence, I am looking for a tool to split the files in, say, 4.
I am afraid I am not familiar with the linux world. So, I am lookiing for scripts (R preferably, or Perl) that would solve this?
Thank you.
jd
Latest Articles
Collapse
-
by seqadmin
During the COVID-19 pandemic, scientists observed that while some individuals experienced severe illness when infected with SARS-CoV-2, others were barely affected. These disparities left researchers and clinicians wondering what causes the wide variations in response to viral infections and what role genetics plays.
Jean-Laurent Casanova, M.D., Ph.D., Professor at Rockefeller University, is a leading expert in this crossover between genetics and infectious...-
Channel: Articles
Yesterday, 10:59 AM -
-
by seqadmin
The first FDA-approved CRISPR-based therapy marked the transition of therapeutic gene editing from a dream to reality1. CRISPR technologies have streamlined gene editing, and CRISPR screens have become an important approach for identifying genes involved in disease processes2. This technique introduces targeted mutations across numerous genes, enabling large-scale identification of gene functions, interactions, and pathways3. Identifying the full range...-
Channel: Articles
08-27-2024, 04:44 AM -
ad_right_rmr
Collapse
News
Collapse
Topics | Statistics | Last Post | ||
---|---|---|---|---|
Started by seqadmin, 09-06-2024, 08:02 AM
|
0 responses
138 views
0 likes
|
Last Post
by seqadmin
09-06-2024, 08:02 AM
|
||
Started by seqadmin, 09-03-2024, 08:30 AM
|
0 responses
141 views
0 likes
|
Last Post
by seqadmin
09-03-2024, 08:30 AM
|
||
Started by seqadmin, 08-27-2024, 04:40 AM
|
0 responses
152 views
0 likes
|
Last Post
by seqadmin
08-27-2024, 04:40 AM
|
||
New Single-Molecule Sequencing Platform Introduces Advanced Features for High-Throughput Genomics
by seqadmin
Started by seqadmin, 08-22-2024, 05:00 AM
|
0 responses
396 views
0 likes
|
Last Post
by seqadmin
08-22-2024, 05:00 AM
|
Leave a comment: