Seqanswers Leaderboard Ad

**dpryan** · 08-21-2013, 07:23 AM

Head isn't ideal since the reads near the beginning of the fastq file tend to be crappier. So, you really want to randomly subsample from the whole fastq file. You should be able to adapt the scripts found here and elsewhere to your case of having the pairs in the same file.

**swbarnes2** · 08-21-2013, 09:20 AM

The first tile in your fastq is going to come from the edge of the flow cell, so won't be as good as tiles in the middle. Use grep | head to get the first 1000 reads from a tile in the middle; that would be better. The tile ID should be the number after the lane.

Topics	Statistics	Last Post
ASHG 2024 Highlights – Part Two by seqadmin Started by seqadmin, Today, 11:09 AM	0 responses 23 views 0 likes	Last Post by seqadmin Today, 11:09 AM
ASHG 2024 Highlights – Part One by seqadmin Started by seqadmin, Today, 06:13 AM	0 responses 20 views 0 likes	Last Post by seqadmin Today, 06:13 AM
Seq-Scope Expands Possibilities for High-Resolution Gene Expression Analysis by seqadmin Started by seqadmin, 11-01-2024, 06:09 AM	0 responses 30 views 0 likes	Last Post by seqadmin 11-01-2024, 06:09 AM
New Model Aims to Explain Polygenic Diseases by Connecting Genomic Mutations and Regulatory Networks by seqadmin Started by seqadmin, 10-30-2024, 05:31 AM	0 responses 21 views 0 likes	Last Post by seqadmin 10-30-2024, 05:31 AM

Seqanswers Leaderboard Ad

Announcement

Subsampling from one paired-end fastq file

Comment

Comment

Latest Articles

ad_right_rmr

News