Unconfigured Ad

**nilshomer** · 07-12-2011, 06:31 AM

Take your read length, and multiply it by the number of bases to get the total bases present in your dataset. So for a 1M SE @ 50bp, you have 50Mb. For 1M PE @50bpx50bp, you have 100Mb. If you look at one PE file (reads1), then you get 50Mb. Note, that the bfast file will contain both PE in the same file, so that would be 100Mb.

**arkal** · 07-12-2011, 06:51 AM

Originally posted by nilshomer View Post

Take your read length, and multiply it by the number of bases to get the total bases present in your dataset. So for a 1M SE @ 50bp, you have 50Mb. For 1M PE @50bpx50bp, you have 100Mb. If you look at one PE file (reads1), then you get 50Mb. Note, that the bfast file will contain both PE in the same file, so that would be 100Mb.

I'm sorry im still a little confused...

The formula i'm using for N is

N = (Genome size x Coverage) / ( RL1 + RL2)

So if my Genome size is 100Mb, Coverage is 10 and RL1 = RL2 = 50 (PE)

N = 100,000,000 x 10 / 100 = 10,000,000 read pairs
i.e *_10X_PE_1.fq = *_10x_PE_2.fq = 10,000,000 reads.

Now, if RL2=0, keeping coverage and genome size the same,
N = 100,000,000 x 10 / 50 = 20,000,000 read pairs or reads
i.e *_10X_SE_.fq1 = 20,000,000 reads and *_10X_SE_2.fq = 0 reads.

I Hope i'm right till here.

Furthermore, if i have already generated 20X coverage PE for the same genome,
N = 100,000,000 x 20 / 100 = 20,000,000 read pairs
i.e *_20X_PE_1.fq = *_20X_PE_2.fq = 20,000,000 reads.

Is it safe to assume that
either *_20X_PE_1.fq OR *_20X_PE_2.fq can be used as a substitute for *_10X_SE_.fq1 as both have the same number of reads?

**nilshomer** · 07-12-2011, 07:24 AM

The answer is yes.

**arkal** · 07-12-2011, 08:59 AM

Thanks a lot

Topics	Statistics	Last Post
High-Resolution Sequencing Exposes Hidden Toxoplasma Diversity by SEQadmin2 Started by SEQadmin2, Today, 11:08 AM	0 responses 6 views 0 reactions	Last Post by SEQadmin2 Today, 11:08 AM
New AI Model Captures Long-Range Genomic Signals to Improve RNA Splice Site Prediction by SEQadmin2 Started by SEQadmin2, 06-30-2026, 05:37 AM	0 responses 11 views 0 reactions	Last Post by SEQadmin2 06-30-2026, 05:37 AM
Large-Scale Protein Screen Uncovers Hidden Regulators of Alternative Polyadenylation by SEQadmin2 Started by SEQadmin2, 06-26-2026, 11:10 AM	0 responses 19 views 0 reactions	Last Post by SEQadmin2 06-26-2026, 11:10 AM
Whole-Genome Sequencing Traces Faroe Islands Ancestry to a North Atlantic Founder Population by SEQadmin2 Started by SEQadmin2, 06-17-2026, 06:09 AM	0 responses 53 views 0 reactions	Last Post by SEQadmin2 06-17-2026, 06:09 AM

Unconfigured Ad

Comment

Comment

Comment

Comment

Latest Articles

ad_right_rmr

News