Seqanswers Leaderboard Ad

**dpryan** · 06-05-2015, 05:34 AM

Most (probably all) packages that are used to find differentially expressed genes will return either a fold-change or a log2 fold-change (this is typically computed on the log2 scale). You would normally compute the fold-change between groups, rather than between samples (since who cares if two samples differ if the groups that they're part of don't).

You're second question relates to the first. Firstly one computes a p-value and then sort the significant results by fold-change, since low abundance genes/transcripts will show randomly high fold-changes. Secondly, one can compute the fold-change by incorporating a prior distribution. This is done in DESeq2, for example, where lowly expressed genes will have their fold-changes shrunken toward 0.

There is no fixed correspondence between RPKM and molecules per cell. In fact, you would be wise to not use RPKM for any statistics, use either raw or estimated counts instead.

**Jamiou** · 06-07-2015, 11:36 PM

Oh, okay. But where do you get the p-value from? That is some sort of hypothesis test, right? So if fold change is gene x in group A / group B, how do I get a p-value from that? And is it possible to get significant p-values ever for genes expressed close to zero (in the groups)?

Why is RPKM bad for statistics? I think I read that some software use RPKM (Cufflinks?). Why is raw (what do you mean by that?) or estimated counts better?

**dpryan** · 06-08-2015, 12:34 AM

Yes, the p-value is derived from a hypothesis test. Popular programs for this include DESeq2, edgeR, limma/voom, and cuffdiff. It's typically not possible to get significant results from very lowly expressed genes, since they tend to lack enough alignments to lend statistical power.

The conversion to RPKM loses all precision information, which makes it difficult to use for statistics. You can google for more.

Topics	Statistics	Last Post
Expanding the Horizons of Cellular Research with the Single Cell Atlas by seqadmin Started by seqadmin, 04-25-2024, 11:49 AM	0 responses 19 views 0 likes	Last Post by seqadmin 04-25-2024, 11:49 AM
Genetic Variants and Diabetes Risk in Childhood Cancer Survivors by seqadmin Started by seqadmin, 04-24-2024, 08:47 AM	0 responses 19 views 0 likes	Last Post by seqadmin 04-24-2024, 08:47 AM
Cancer Metastasis: A Deep Dive into Cellular Plasticity by seqadmin Started by seqadmin, 04-11-2024, 12:08 PM	0 responses 62 views 0 likes	Last Post by seqadmin 04-11-2024, 12:08 PM
Proteogenomic Profiles Offer New Clues in Prostate Cancer by seqadmin Started by seqadmin, 04-10-2024, 10:19 PM	0 responses 60 views 0 likes	Last Post by seqadmin 04-10-2024, 10:19 PM

Seqanswers Leaderboard Ad

Announcement

Basic questions about fold change calculations

Comment

Comment

Comment

Latest Articles

ad_right_rmr

News