Hi everybody,
I read a lot in the last few days about the different opinions to rna-seq normalization methods.
To be honest I'm quite a bit confused at the moment and so I would like to ask for your help to try and clarify me about how to use what kind of normalization method.
I'm sure that there is no straightforward answer for such a question but I would really appreciate contradictory opinions if it will help for other users also to explain the problem.
As far as I understand it there is no "standard" method for normalizing methods.
We have one rna-seq experiment with each only one set for control and one set for treatment. Albeit the fact of insignificance regarding the lack of replicates, I would like to understand how to work in general with rna-seq data.
we would like to look into both differential expression and differences in splice variants between the two conditions.
I have read opinion about how to normalize the data in best way for identifying differentially expressed genes and for identifying isoforms.
Apparently these two goals should be analyzed differently.
The best example for that was the discussion between Simon and lpachter about when to normalize how here: http://seqanswers.com/forums/showthr...p?t=586&page=1
I think it shows how controversy this can be. I was interested in this discussion, though it is quite an old one and a lot have changed probably.
RPKM measure the relative level of gene expression between experiments, but apparently some people are against it, due to certain biases, which it can't compensate. In the posting above, Simon mentions DESeq (EdgeR), which suppose to work better for differential expression
So my questions are:
(well I will probably have a lot more, but these are to begin with)
1. Will it be better to normalize the data twice separately for the two goals
2. Does it make sense to normalize data one time after the other?
3. Can I relay on cuffdiff/cuffcompare to give me a good estimation on the splice variants and on DESeq/DEGSeq to give me a good estimation about the differentially expressed genes?
I would appreciate every comment or discussion.
Thanks
A.
I read a lot in the last few days about the different opinions to rna-seq normalization methods.
To be honest I'm quite a bit confused at the moment and so I would like to ask for your help to try and clarify me about how to use what kind of normalization method.
I'm sure that there is no straightforward answer for such a question but I would really appreciate contradictory opinions if it will help for other users also to explain the problem.
As far as I understand it there is no "standard" method for normalizing methods.
We have one rna-seq experiment with each only one set for control and one set for treatment. Albeit the fact of insignificance regarding the lack of replicates, I would like to understand how to work in general with rna-seq data.
we would like to look into both differential expression and differences in splice variants between the two conditions.
I have read opinion about how to normalize the data in best way for identifying differentially expressed genes and for identifying isoforms.
Apparently these two goals should be analyzed differently.
The best example for that was the discussion between Simon and lpachter about when to normalize how here: http://seqanswers.com/forums/showthr...p?t=586&page=1
I think it shows how controversy this can be. I was interested in this discussion, though it is quite an old one and a lot have changed probably.
RPKM measure the relative level of gene expression between experiments, but apparently some people are against it, due to certain biases, which it can't compensate. In the posting above, Simon mentions DESeq (EdgeR), which suppose to work better for differential expression
So my questions are:
(well I will probably have a lot more, but these are to begin with)
1. Will it be better to normalize the data twice separately for the two goals
2. Does it make sense to normalize data one time after the other?
3. Can I relay on cuffdiff/cuffcompare to give me a good estimation on the splice variants and on DESeq/DEGSeq to give me a good estimation about the differentially expressed genes?
I would appreciate every comment or discussion.
Thanks
A.
Comment