Seqanswers Leaderboard Ad

**GenoMax** · 01-31-2014, 08:25 AM

Have a look at this paper: http://seqanswers.com/forums/showthread.php?t=40365

**TonyBrooks** · 01-31-2014, 08:33 AM

http://core-genomics.blogspot.co.uk/...ions-need.html

Some comments and analysis from the exciting and fast moving world of Genomics. This blog focuses on next-generation sequencing and microarray technologies, although it is likely to go off on tangents from time-to-time

You could run all 20 of your samples across 2 lanes and get somewhere approaching 20m reads per sample. This should be more than adequate for differential expression analysis.

**dbroh11** · 01-31-2014, 11:45 AM

Hey Guys,

It looks like for a run of the mill differential gene expression analysis, 20-30 M reads is more than sufficient based on your response, Tony, and the paper that GenoMax kindly supplied.

While we are most interested in differential gene expression, we still want to have a thorough representation of the transcriptome for both control and disease groups including novel transcripts. We aren't overly concerned with the ability to capture transcripts expressed at very low levels. Does 20-30 M still sound like a safe bet given these additional points?

Further, while I understand the use of short reads is more amenable to differential gene expression analysis than it is for isoform detection or mapping, I would like to optimize our short read study design in a way that most benefits the Tuxedo Suite software algorithms in probabilistically guessing what isoforms we have present. This led me to choose the Illumina platform over Solid (100 bp reads over 35 bp reads), and paired end instead of single end reads to aid in alignment efforts. Does my rationale and this aspect of the study design sound appropriate for my goal?

I appreciate your helping a newbie

Dave

**GenoMax** · 01-31-2014, 12:56 PM

Originally posted by TonyBrooks View Post

http://core-genomics.blogspot.co.uk/...ions-need.html

You could run all 20 of your samples across 2 lanes and get somewhere approaching 20m reads per sample. This should be more than adequate for differential expression analysis.

@Tony: Can you correct this URL? It does not seem to be pointing to a specific link.

**GenoMax** · 01-31-2014, 01:12 PM

Originally posted by dbroh11 View Post

This led me to choose the Illumina platform over Solid (100 bp reads over 35 bp reads), and paired end instead of single end reads to aid in alignment efforts. Does my rationale and this aspect of the study design sound appropriate for my goal?

I appreciate your helping a newbie

Dave

Sequencing more reads is not going to hurt but what the general consensus is that you do not want to go overboard (i.e. 100 million) since that is a case of diminishing returns.

There has been past discussion on benefits of single-end and paired-end reads but nothing that is of recent vintage. Here are a couple of links to peruse.

50 bp paired end reads vs. 100 bp single end reads - SEQanswers

http://seqanswers.com/forums/showthread.php?t=13474

Discussion of next-gen sequencing related bioinformatics: resources, algorithms, open source efforts, etc

Paired-end or Single-end? - SEQanswers

http://seqanswers.com/forums/showthread.php?t=9116

Discussion of next-gen sequencing related bioinformatics: resources, algorithms, open source efforts, etc

**westerman** · 02-03-2014, 09:33 AM

Our sequencing center most often aims for 30M reads per sample for rnaSeq projects. However balancing the samples to get 30M each is troublesome. The way we do this is to do one (partial) sequencing run that undershoots 30M and then re-cluster the samples so that the next run will combined with the first run in order to bring up the per-sample reads to 30M. If you were going to do a 'one-shot' sequencing run then you will have to aim for around 50M reads in order to have at least 25M reads per sample. I'll agree that aiming for 100M reads is overkill.

**dbroh11** · 02-03-2014, 10:02 AM

I appreciate your guys help with this - Do you have literature aside from the paper GenoMax sent that supports using far less than 100M reads (what Encode proposed?) I understand ENCODE is not the end all be all and their recommendations are several years old, but would like to better understand rationale/see more empirical data suggesting 50M is sufficient prior to committing funds to the project.

In addition, do you all know of any literature out there showing the use of 100 bp reads over 35 bp reads (Illumina vs SOLiD) truly benefits Cufflink's estimation of the prevalence of different isoforms? We are most interested in differential gene expression so I have narrowed our design down to using shorter reads, but am still mulling over the pros and cons of these two platforms.

Many thanks

Dave

**gringer** · 02-03-2014, 12:43 PM

If you want the best isoform detection capability and have lots of money, long paired-end reads on illumina are the best option. Note that with 250bp reads and a 400bp fragment length, you should be able to get 400bp of continuous sequencing for most reads, with overlap (for consistency checks) around 50bp. We've found that 30Mish reads (i.e. 10M~100M) are fine for hypothesis-generating analysis, so go ahead and multiplex if you've got more than that.

The longer the sequence, the more chance you have of catching multiple splice points in a single read. If you don't do this you have to guess at possible isoforms based on frequency counts.

**AllSeq** · 02-04-2014, 08:36 AM

Originally posted by gringer View Post

If you want the best isoform detection capability and have lots of money, long paired-end reads on illumina are the best option.

If you want the best isoform detection capability and have an INSANE amount of money, PacBio runs with a few different size selections would be the best option.

**mukeshwar** · 02-04-2014, 10:05 AM

Hi,

I am using TruSeq RNA sample prep kit v2 for WTA library. I started with the 6 ug of total RNA followed by Elute-prime fragment for 2 mins, 1st strand cDNA and then 2nd strand cDNA synthesis and got the following qubit readings

Elute primer fragment (RNA BR Assay): 15.6 ng/ul
dsCDNA synthesis (DNA dsHS assay)_before 1.8x bead purification: 0.312 ng/ul
dsCDNA synthesis (DNA dsHS assay)_after 1.8x bead purification: 0.225 ng/ul

On the basis of qubit reading i wanted to know that

>is it enough concentration of dscDNA or i am loosing the dscDNA amount? i didn't check the dscDNA profile on HS chip.
> My mRNA enrichment process and the results are satisfactory for cDNA conversion ?
> Is cDNA conversion done?
> How can i check my first strand cDNA product?

Basically, i wanted to know the checkpoints of each step to confirm that library preparation protocol is running correctly?

**GenoMax** · 02-04-2014, 12:09 PM

Originally posted by mukeshwar View Post

Hi,

I am using TruSeq RNA sample prep kit v2 for WTA library. I started with the 6 ug of total RNA followed by Elute-prime fragment for 2 mins, 1st strand cDNA and then 2nd strand cDNA synthesis and got the following qubit readings

Elute primer fragment (RNA BR Assay): 15.6 ng/ul
dsCDNA synthesis (DNA dsHS assay)_before 1.8x bead purification: 0.312 ng/ul
dsCDNA synthesis (DNA dsHS assay)_after 1.8x bead purification: 0.225 ng/ul

On the basis of qubit reading i wanted to know that

>is it enough concentration of dscDNA or i am loosing the dscDNA amount? i didn't check the dscDNA profile on HS chip.
> My mRNA enrichment process and the results are satisfactory for cDNA conversion ?
> Is cDNA conversion done?
> How can i check my first strand cDNA product?

Basically, i wanted to know the checkpoints of each step to confirm that library preparation protocol is running correctly?

Please create a new thread since your question is unrelated to the thread you posted in.

New threads can be created by:

SeqAnswers.com --> Click "Forums" left navigation box --> Choose an appropriate forum to post question in --> "New Thread" button at top left.

You can then delete this post by choosing "Edit" --> "Go Advanced" --> Delete.

Topics	Statistics	Last Post
A Close Examination at Probiotic-Related Bacteremia by seqadmin Started by seqadmin, 05-02-2024, 08:06 AM	0 responses 17 views 0 likes	Last Post by seqadmin 05-02-2024, 08:06 AM
Expanded Genetic Insights into Blood Pressure Regulation by seqadmin Started by seqadmin, 04-30-2024, 12:17 PM	0 responses 20 views 0 likes	Last Post by seqadmin 04-30-2024, 12:17 PM
The Role of Enhancers in Defining Cell Fate by seqadmin Started by seqadmin, 04-29-2024, 10:49 AM	0 responses 27 views 0 likes	Last Post by seqadmin 04-29-2024, 10:49 AM
Expanding the Horizons of Cellular Research with the Single Cell Atlas by seqadmin Started by seqadmin, 04-25-2024, 11:49 AM	0 responses 28 views 0 likes	Last Post by seqadmin 04-25-2024, 11:49 AM

Seqanswers Leaderboard Ad

Announcement

RNA-Seq Experimental Design Questions

Comment

Comment

Comment

Comment

Comment

Comment

Comment

Comment

Comment

Comment

Comment

Latest Articles

ad_right_rmr

News