Seqanswers Leaderboard Ad

**jparsons** · 05-09-2013, 06:02 AM

The easy route is to just follow the "method" used in the Cell paper (Revisiting Global Gene Expression Analysis, 151, Oct 2012). They do more or less exactly what your first instinct suggests, fitting a regression of the ERCC spikes and renormalizing.

When running any cufflinks/cuffdiff analysis in a sample which contains ERCCs, you don't want to keep ERCC-mapped reads in the denominator of your FPKM calculation. You could either normalize them away (by multiplying through by total reads / Total non-ERCC Reads) or you could prevent them from showing up in the first place (by mapping them separately) and then factoring in their relative ratios after the fact. I prefer the latter method, because I don't understand everything that Cufflinks does in its calculations, and I don't trust that the presence of spiked-in RNA doesn't cause one of Cufflinks' calculations to make an assumption that isn't true in my sample.

The renormalized values would still be FPKM, as you are merely correcting for the incorrect assumption that Cufflinks initially makes about your sample. You should be able to carry forward with Cuffdiff after you change the denominator to the proper value.

**Eric Fournier** · 05-10-2013, 06:30 AM

Thank you very much! The article was a very nice read.

**danwiththeplan** · 05-13-2013, 03:58 PM

Eric I'm curious, did you multiplex all your samples and run them on a single lane? So you required 30 separate spikes, for each library, which were then multiplex-tagged and combined? Or were they all on a different lane?

Also, did you use the ExFold mix to look at fold-change effects?

**Eric Fournier** · 05-14-2013, 06:35 AM

Hello Dan,

we ran our samples on five different lanes. Each lane used 6 of 8 possible multiplex tags from the Encore multiplex kit (which uses 4nt tags). This actually caused a small problem, since one of the combination of 6 tags that we used caused library complexity for the first four nucleotides to go down substantially, which was reflected as low quality values across the whole library.

The ERCC spikes were added immediatly after RNA extraction, while the multiplexing was done just prior to sending the libraries to the sequencing center.

Since we had 10 different tissues and that we were not interested in any particular pairwise comparison, we did not use th Exfold mix to assess fold-change effects. Rather, we used only mix 1 from the ERCC to have one shared standard across all libraries.

Topics	Statistics	Last Post
ASHG 2024 Highlights – Part Two by seqadmin Started by seqadmin, Today, 11:09 AM	0 responses 22 views 0 likes	Last Post by seqadmin Today, 11:09 AM
ASHG 2024 Highlights – Part One by seqadmin Started by seqadmin, Today, 06:13 AM	0 responses 19 views 0 likes	Last Post by seqadmin Today, 06:13 AM
Seq-Scope Expands Possibilities for High-Resolution Gene Expression Analysis by seqadmin Started by seqadmin, 11-01-2024, 06:09 AM	0 responses 30 views 0 likes	Last Post by seqadmin 11-01-2024, 06:09 AM
New Model Aims to Explain Polygenic Diseases by Connecting Genomic Mutations and Regulatory Networks by seqadmin Started by seqadmin, 10-30-2024, 05:31 AM	0 responses 21 views 0 likes	Last Post by seqadmin 10-30-2024, 05:31 AM

Seqanswers Leaderboard Ad

Announcement

Normalizing with ERCC spike-in

Comment

Comment

Comment

Comment

Latest Articles

ad_right_rmr

News