I have trouble figuring out whether to use FPKM values for measuring expression, or to stick with raw counts. I've heard that we might have too few biological replicates from each tissue, to be able to rely on the FPKM values. Does anyone else have experience with this issue? How do I decide which to use?
Seqanswers Leaderboard Ad
Collapse
Announcement
Collapse
No announcement yet.
X
-
You say you want to measure expression. But are you looking for differential expression between samples, or absolute expression to see what is most highly expressed?
FPKM is a measure of absolute abundance of a gene and can therefore be used to compare expression between genes.
Counts are relative. For differential expression analysis you are not looking between genes, but within them across replicates to see if a gene is more highly expressed in a condition or treatment.
If you are doing DE analysis using DESeq or edgeR for example, use counts. To look at sets of genes which may be co-expressed, for example, then FPKMs may be of interest.
-
Thank you for your quick response!
- It is indeed differential expression between different samples that we are interested in.
- We have primarily been using cuffdiff for the purposes of DE.
How relevant the issue of having only 3 biological replicates for each sample, in deciding whether to choose counts or FPKM?
Comment
-
I usually stick to the counts. Using the counts you do know exactly how many reads are mapped to a gene, which I prefer. I always normalize the counts for its library size in order to compare the counts across samples.
Cufflinks does correct for gene length, but I don't think there is a need to correct for gene length when only comparing genes between samples.
In order to get the differentially expressed genes I usually use the Voom method which is in the Limma/edgeR package. This method takes raw genecounts as an input and does normalize the data within the voom method.Last edited by iris_aurelia; 03-06-2013, 05:47 AM.
Comment
-
Not sure the number of replicates is relevant at all in using FPKM or counts.
Personally I have never been happy with cuff* analysis. It seems very conservative. I like to get something out of my DE analysis, but then someone may criticise that attitude!
I would try using count data in edgeR if you can use R. The manual is pretty helpful and there are many tutorials on line. 3 replicates per condition is ok, the issue is you won't have too much confidence in the results, unless you use cell culture with a very well defined response to treatment(?)
@iris_aurelia I agree with not needing to correct gene length: the comparison is within the gene, not between them.
Comment
-
I see. And I assume by that one can avoid the stringency issues that cuffdiff has with calculating q-values? That would be very promising. Thank you very much.
In that case I have a related question, but I don't know what the proper protocol is with asking separate questions within a single thread. Maybe I can link to it here: http://seqanswers.com/forums/showthread.php?t=28117
Comment
-
-
@Cynh @Pengchy
Cufflinks output should not be used for DeSeq/EdgeR. These use raw counts, which you can get after aligning with TopHat using HTSeqCount or similar programs.
Check this previous thread; there are a couple others referring to this issue.
Comment
Latest Articles
Collapse
-
by seqadmin
Non-coding RNAs (ncRNAs) do not code for proteins but play important roles in numerous cellular processes including gene silencing, developmental pathways, and more. There are numerous types including microRNA (miRNA), long ncRNA (lncRNA), circular RNA (circRNA), and more. In this article, we discuss innovative ncRNA research and explore recent technological advancements that improve the study of ncRNAs.
Nobel Prize for MicroRNA Discovery
This week,...-
Channel: Articles
10-07-2024, 08:07 AM -
-
by seqadmin
Metagenomics has improved the way researchers study microorganisms across diverse environments. Historically, studying microorganisms relied on culturing them in the lab, a method that limits the investigation of many species since most are unculturable1. Metagenomics overcomes these issues by allowing the study of microorganisms regardless of their ability to be cultured or the environments they inhabit. Over time, the field has evolved, especially with the advent...-
Channel: Articles
09-23-2024, 06:35 AM -
ad_right_rmr
Collapse
News
Collapse
Topics | Statistics | Last Post | ||
---|---|---|---|---|
Started by seqadmin, Today, 06:55 AM
|
0 responses
8 views
0 likes
|
Last Post
by seqadmin
Today, 06:55 AM
|
||
Started by seqadmin, 10-02-2024, 04:51 AM
|
0 responses
105 views
0 likes
|
Last Post
by seqadmin
10-02-2024, 04:51 AM
|
||
Started by seqadmin, 10-01-2024, 07:10 AM
|
0 responses
114 views
0 likes
|
Last Post
by seqadmin
10-01-2024, 07:10 AM
|
||
Started by seqadmin, 09-30-2024, 08:33 AM
|
1 response
117 views
0 likes
|
Last Post
by EmiTom
10-07-2024, 06:46 AM
|
Comment