Seqanswers Leaderboard Ad

**kmcarr** · 04-02-2009, 04:47 AM

The *_seq.txt files in the Bustard directory include the sequence of every cluster (read). The *_sequences.txt files in the GERALD directory only include the passed filter reads.

**lajoieb** · 04-02-2009, 05:00 AM

Ah ok!
That makes perfect sense then.

Appreciate the explanation.

bryan

**Torst** · 04-08-2009, 05:52 PM

Originally posted by lajoieb View Post

wc -l s_*_sequence.txt (fastq)

To count the number of reads in a FASTQ file, you can use grep in -c (counting) mode:

Code:

grep -c '^\+' *_sequence.txt

s_6_sequence.txt:4525658
s_7_sequence.txt:4485601
s_8_sequence.txt:4099309

Note you can't match on '@' as that is a valid quality value (Q=0), wherease '+' is not used in Illumina FASTQ. This is easier than dividing by four in your head... although the shell can help us there too:

Code:

echo $[ `wc -l < s_6_sequence.txt` / 4 ]

4525658

Enjoy

Topics	Statistics	Last Post
The Adaptation of the Cell Cycle in Multiciliated Cells by seqadmin Started by seqadmin, Yesterday, 06:58 AM	0 responses 13 views 0 likes	Last Post by seqadmin Yesterday, 06:58 AM
New Method for DNA Sequence Amplification by seqadmin Started by seqadmin, 06-06-2024, 08:18 AM	0 responses 20 views 0 likes	Last Post by seqadmin 06-06-2024, 08:18 AM
New Tools Enhance Single-Molecule DNA Analysis with Minimal Samples by seqadmin Started by seqadmin, 06-06-2024, 08:04 AM	0 responses 18 views 0 likes	Last Post by seqadmin 06-06-2024, 08:04 AM
SIX2 Protein Identified as a Key Player in Prostate Cancer Treatment Resistance by seqadmin Started by seqadmin, 06-03-2024, 06:55 AM	0 responses 13 views 0 likes	Last Post by seqadmin 06-03-2024, 06:55 AM

Seqanswers Leaderboard Ad

Announcement

solexa output files | s__seq.txt vs. s__sequencece.txt

Comment

Comment

Comment

Latest Articles

ad_right_rmr

News

Seqanswers Leaderboard Ad

Announcement

solexa output files | s_*_seq.txt vs. s_*_sequencece.txt

Comment

Comment

Comment

Latest Articles

ad_right_rmr

News

solexa output files | s__seq.txt vs. s__sequencece.txt