Header Leaderboard Ad

Collapse

splitting big paired fastq files

Collapse

Announcement

Collapse
No announcement yet.
X
 
  • Filter
  • Time
  • Show
Clear All
new posts

  • splitting big paired fastq files

    Hi there,
    I do my 'bioinformatic' work in CLC. Now I sit with many (30) large files with paired end reads (~10GB each direction) and my computer is stalling if I'd try to use all in a de novo assembly. Hence, I am looking for a tool to split the files in, say, 4.
    I am afraid I am not familiar with the linux world. So, I am lookiing for scripts (R preferably, or Perl) that would solve this?

    Thank you.
    jd

  • #2
    If you split your fastq, you aren't going to get a good assembly. You really want a computer with more memory, so it can handle the whole fatq.

    If you really need to split it, use unix built-in programs.

    Code:
    split -l 40000000 myfastq.fq
    should split it into separate files, each with 40,000,000 lines, or 10 million reads.

    Comment


    • #3
      Thank you for your prompt reply!
      There are 150-200 mill reads in each of the paired fastq files and I just expected that to be quite redundant.

      Comment

      Latest Articles

      Collapse

      • seqadmin
        How RNA-Seq is Transforming Cancer Studies
        by seqadmin



        Cancer research has been transformed through numerous molecular techniques, with RNA sequencing (RNA-seq) playing a crucial role in understanding the complexity of the disease. Maša Ivin, Ph.D., Scientific Writer at Lexogen, and Yvonne Goepel Ph.D., Product Manager at Lexogen, remarked that “The high-throughput nature of RNA-seq allows for rapid profiling and deep exploration of the transcriptome.” They emphasized its indispensable role in cancer research, aiding in biomarker...
        09-07-2023, 11:15 PM
      • seqadmin
        Methods for Investigating the Transcriptome
        by seqadmin




        Ribonucleic acid (RNA) represents a range of diverse molecules that play a crucial role in many cellular processes. From serving as a protein template to regulating genes, the complex processes involving RNA make it a focal point of study for many scientists. This article will spotlight various methods scientists have developed to investigate different RNA subtypes and the broader transcriptome.

        Whole Transcriptome RNA-seq
        Whole transcriptome sequencing...
        08-31-2023, 11:07 AM

      ad_right_rmr

      Collapse

      News

      Collapse

      Topics Statistics Last Post
      Started by seqadmin, Today, 07:42 AM
      0 responses
      10 views
      0 likes
      Last Post seqadmin  
      Started by seqadmin, 09-22-2023, 09:05 AM
      0 responses
      23 views
      0 likes
      Last Post seqadmin  
      Started by seqadmin, 09-21-2023, 06:18 AM
      0 responses
      16 views
      0 likes
      Last Post seqadmin  
      Started by seqadmin, 09-20-2023, 09:17 AM
      0 responses
      16 views
      0 likes
      Last Post seqadmin  
      Working...
      X