In the ENCODE RNA-seq guidelines it states:
"30M pair-end reads of length > 30NT, of which 20-25M are mappable to the genome or known transcriptome"
and
"For experiments from a typical mammalian tissue or in which sensitivity of detection is important, a minimum depth of 100-200 M 2 x 76 bp or longer reads is currently recommended. "
My question: The 30 million they refer to is counting both reads in a pair individually or do they count one pair as one read?
So, say I decide on a read length of 100. The amount of data I will require(or ask from a provider) is:
(A) 100(read length) X 30,000,000 = 3Gb
Or if I count each pair as one i.e 30 million pairs
(B) 100(read length) X 2 (for each pair) X 30,000,000 = 6 Gb
Should it be (A) or (B)?
PS: I know the question is basic and I tried to do a search but couldn't find a definite answer.
"30M pair-end reads of length > 30NT, of which 20-25M are mappable to the genome or known transcriptome"
and
"For experiments from a typical mammalian tissue or in which sensitivity of detection is important, a minimum depth of 100-200 M 2 x 76 bp or longer reads is currently recommended. "
My question: The 30 million they refer to is counting both reads in a pair individually or do they count one pair as one read?
So, say I decide on a read length of 100. The amount of data I will require(or ask from a provider) is:
(A) 100(read length) X 30,000,000 = 3Gb
Or if I count each pair as one i.e 30 million pairs
(B) 100(read length) X 2 (for each pair) X 30,000,000 = 6 Gb
Should it be (A) or (B)?
PS: I know the question is basic and I tried to do a search but couldn't find a definite answer.
Comment