Seqanswers Leaderboard Ad

**chadn737** · 03-08-2013, 07:12 AM

1) ~3-3.5 GB output per sample
2) You are on the low end. If your goal is differential expression, then GBs is not really that informative, because what you are interested in is the number of reads/fragments. For differential expression, a single accurately mapped 50bp read gives about as much information as a 100bp paired end reads. Now if you are also looking for alternative splicing, then the added bps help. I am assuming these are paired end, 100bp data? If so, then the number of paired end reads you will have will be ~3GBs/200bps = ~15,000,000 reads per sample. Yes, its possible to detect differential expression at that level. But to give you an idea, I would say ~10,000,000 to be the lower limit for Arabidopsis. Also, not all reads will map or map correctly, so that you should expect to lose data. If you can get ~15,000,000 reads for 8 samples on a single lane, then I would go with two lanes of data, get ~30,000,000 which will give you a much better representation. Your ability to detect differential expression accurately is very much dependent upon read counts and if you take the minimal number of reads, then lower expressed genes will be a problem.
3) I don't like calculating coverage for RNA-seq. Each gene is expressed differently and how does one make sense of coverage from such data? For this to make sense, then you need to know a priori how many copies of each RNA you have.....in which case there is no need to do an experiment.

Also, make sure you have biological replicates. It is pointless if you do not have biological replicates.

**ingenerito** · 03-08-2013, 08:20 AM

Thanks for your response so quickly!!

Originally posted by chadn737 View Post

I am assuming these are paired end, 100bp data? If so, then the number of paired end reads you will have will be ~3GBs/200bps = ~15,000,000 reads per sample. Yes, its possible to detect differential expression at that level. But to give you an idea, I would say ~10,000,000 to be the lower limit for Arabidopsis. Also, not all reads will map or map correctly, so that you should expect to lose data. If you can get ~15,000,000 reads for 8 samples on a single lane, then I would go with two lanes of data, get ~30,000,000 which will give you a much better representation.

Yes are paired end 100 bp data, really are 4 samples with a biological replicates
an schema is:
RNA from:
plant A: 2 one grape cluster in 2 different times
plant B: 2 one grape cluster in 2 different times
each sample separately, like follow:

PLANT-TIME-CLUSTER
A-T1-C1
A-T1-C2
A-T2-C1
A-T2-C2
B-T1-C1
B-T1-C2
B-T2-C1
B-T2-C2

so, all "C2" are the biological replicates. Thanks again!

**chadn737** · 03-08-2013, 08:43 AM

Its good you have Biological reps. I just checked and the number of grape genes ~30,000 is not much more than Arabidopsis and a lot less than some of the other species I have worked with. You could get away with ~15,000,000 reads, but from experience, getting that extra lane of data and increased depth of sequencing makes a huge difference. So I would really encourage you to use at least 2 lanes of data.

Topics	Statistics	Last Post
Bacterial Timeline Study Suggests Oxygen Use Preceded Photosynthesis by seqadmin Started by seqadmin, Today, 12:59 PM	0 responses 6 views 0 reactions	Last Post by seqadmin Today, 12:59 PM
New Software Simplifies 3D Gene Expression Mapping by seqadmin Started by seqadmin, Yesterday, 10:17 AM	0 responses 8 views 0 reactions	Last Post by seqadmin Yesterday, 10:17 AM
AI Tool Creates High-Resolution 3D Maps of the Mouse Brain by seqadmin Started by seqadmin, 03-20-2025, 05:03 AM	0 responses 49 views 0 reactions	Last Post by seqadmin 03-20-2025, 05:03 AM
Studying Microbial Gene Transfer with RNA Barcoding by seqadmin Started by seqadmin, 03-19-2025, 07:27 AM	0 responses 60 views 0 reactions	Last Post by seqadmin 03-19-2025, 07:27 AM

Seqanswers Leaderboard Ad

RNA-Seq throughput

Comment

Comment

Comment

Latest Articles

ad_right_rmr

News