Unconfigured Ad

**dGho** · 03-12-2013, 06:40 AM

split files

I have split fastq files to run Tophat. From what I understand is that this is a fairly common practice. Here is a hypothetical example:

#split read 1 into smaller files after every 40,000,000 lines
split -l 40000000 wholefile_read1.fastq ;
#rename resulting files
mv xaa wholefile_read1_1.fastq
mv xab wholefile_read1_2.fastq
.
.
#split read 2 into smaller files after every 40,000,000 lines
split -l 40000000 wholefile_read2.fastq
#rename resulting files
mv xaa wholefile_read2_1.fastq
mv xab wholefile_read2_2.fastq
.
.
#align split files with tophat
tophat -o out_1 -G mm10.gtf mm10 wholefile_read1_1.fastq wholefile_read2_1.fastq
tophat -o out_2 -G mm10.gtf mm10 wholefile_read1_2.fastq wholefile_read2_2.fastq
.
.
#use samtools to put the bam files back together
Samtools merge out.bam out_1 out_2

**dGho** · 03-12-2013, 06:44 AM

I didn't answer your question

I guess I did not exactly answer your question though. I do not know if there is any difference in results when the files are split. I do know that my very experienced co-worker does it all the time. That does not necessarily help.

Topics	Statistics	Last Post
Whole-Genome Sequencing Traces Faroe Islands Ancestry to a North Atlantic Founder Population by SEQadmin2 Started by SEQadmin2, 06-17-2026, 06:09 AM	0 responses 36 views 0 reactions	Last Post by SEQadmin2 06-17-2026, 06:09 AM
Sequencing the Two-Toed Sloth Genome Reveals Jumping Genes Tied to Its Extreme Metabolism by SEQadmin2 Started by SEQadmin2, 06-09-2026, 11:58 AM	0 responses 99 views 0 reactions	Last Post by SEQadmin2 06-09-2026, 11:58 AM
A New Method Makes Hantavirus Genome Analysis Faster and More Accessible by SEQadmin2 Started by SEQadmin2, 06-05-2026, 10:09 AM	0 responses 120 views 0 reactions	Last Post by SEQadmin2 06-05-2026, 10:09 AM
A New Single-Cell Method Maps DNA-Protein Interactions by SEQadmin2 Started by SEQadmin2, 06-04-2026, 08:59 AM	0 responses 113 views 0 reactions	Last Post by SEQadmin2 06-04-2026, 08:59 AM

Unconfigured Ad

Split fastq files for tophat analysis

Comment

Comment

Latest Articles

ad_right_rmr

News