Take your read length, and multiply it by the number of bases to get the total bases present in your dataset. So for a 1M SE @ 50bp, you have 50Mb. For 1M PE @50bpx50bp, you have 100Mb. If you look at one PE file (reads1), then you get 50Mb. Note, that the bfast file will contain both PE in the same file, so that would be 100Mb.
Unconfigured Ad
Collapse
X
-
I'm sorry im still a little confused...Originally posted by nilshomer View PostTake your read length, and multiply it by the number of bases to get the total bases present in your dataset. So for a 1M SE @ 50bp, you have 50Mb. For 1M PE @50bpx50bp, you have 100Mb. If you look at one PE file (reads1), then you get 50Mb. Note, that the bfast file will contain both PE in the same file, so that would be 100Mb.
The formula i'm using for N is
N = (Genome size x Coverage) / ( RL1 + RL2)
So if my Genome size is 100Mb, Coverage is 10 and RL1 = RL2 = 50 (PE)
N = 100,000,000 x 10 / 100 = 10,000,000 read pairs
i.e *_10X_PE_1.fq = *_10x_PE_2.fq = 10,000,000 reads.
Now, if RL2=0, keeping coverage and genome size the same,
N = 100,000,000 x 10 / 50 = 20,000,000 read pairs or reads
i.e *_10X_SE_.fq1 = 20,000,000 reads and *_10X_SE_2.fq = 0 reads.
I Hope i'm right till here.
Furthermore, if i have already generated 20X coverage PE for the same genome,
N = 100,000,000 x 20 / 100 = 20,000,000 read pairs
i.e *_20X_PE_1.fq = *_20X_PE_2.fq = 20,000,000 reads.
Is it safe to assume that
either *_20X_PE_1.fq OR *_20X_PE_2.fq can be used as a substitute for *_10X_SE_.fq1 as both have the same number of reads?
Comment
-
Latest Articles
Collapse
-
by GATTACATLove this - good data definitely starts from good input, and poor input can only give relatively poor data. I particularly like the mention of Nanodrop/absorbance based methods for quantification. It's such a toss up if you'll get an accurate reading or what amounts to a randomly generated number, and a lot of library/sequencing related issues can be traced back to poor quant.
-
Channel: Articles
Yesterday, 11:43 AM -
-
by SEQadmin2
I’m not a sequencing expert. I’m a purification scientist who uses NGS to evaluate workflows my group develops. With this perspective, we think about the sample first and the NGS workflow second. The sequencer is an exceptionally honest reporter, but it can only report on what you give it, so whether you get clean, interpretable data from an NGS workflow is largely determined before you begin.
Here are nine questions we think about, in roughly the order they matter, before...-
Channel: Articles
-
ad_right_rmr
Collapse
News
Collapse
| Topics | Statistics | Last Post | ||
|---|---|---|---|---|
|
Started by SEQadmin2, Today, 11:08 AM
|
0 responses
6 views
0 reactions
|
Last Post
by SEQadmin2
Today, 11:08 AM
|
||
|
Started by SEQadmin2, 06-30-2026, 05:37 AM
|
0 responses
11 views
0 reactions
|
Last Post
by SEQadmin2
06-30-2026, 05:37 AM
|
||
|
Started by SEQadmin2, 06-26-2026, 11:10 AM
|
0 responses
19 views
0 reactions
|
Last Post
by SEQadmin2
06-26-2026, 11:10 AM
|
||
|
Whole-Genome Sequencing Traces Faroe Islands Ancestry to a North Atlantic Founder Population
by SEQadmin2
Started by SEQadmin2, 06-17-2026, 06:09 AM
|
0 responses
53 views
0 reactions
|
Last Post
by SEQadmin2
06-17-2026, 06:09 AM
|
Comment