Seqanswers Leaderboard Ad

**dpryan** · 10-30-2013, 02:02 AM

What are the normalized (not raw) counts for that gene? If the unadjusted p-value isn't itself below 0.05 then I'd be surprised if it ever popped up as being DE. BTW, just because a paper reports something as being significant, that doesn't mean it actually is...there are a LOT of crap analyses in the literature.

**elipsoid** · 10-30-2013, 02:13 AM

Thank you for your reply.

As I am really really new to this, I have a tendency to believe the papers has the right method and something's wrong with mine. But maybe they just analyzed their data in a bad way :/
It's been 8 hours i'm trying to obtain the same results for their "key gene" (they mention three in the paper, I assumed those three weren't obtained by miss-interpretation).

Here are the normalized counts (well I think those are, they are the results of :
> cdsNorm = counts(cds, normalized = TRUE)

> cdsNorm[1948,]
countN_rep1 countN_rep2 countN_rep3 countN_rep4 countN_rep5 countN_rep6
1039.511 1218.836 1043.076 1150.849 3892.303 5382.503
countH_rep1 countH_rep2 countH_rep3 countH_rep4
6668.934 90394.665 8004.270 10207.432

**bye** · 10-30-2013, 10:11 AM

I'm not a statistician either, but to call a comparison significant in t test, the important thing is that the difference between group means is larger than within group variations. From your data, the raw and normalized counts, both showed difference between two groups. But when you used "pooled" estimate for dispersion, you will get larger variation for both groups, therefore eliminated the difference between group means. Since you have enough replicates, you should use "per-condition" estimate of dispersion, then the p value will surely be significant.

**dpryan** · 10-30-2013, 10:48 AM

Originally posted by elipsoid View Post

Thank you for your reply.

As I am really really new to this, I have a tendency to believe the papers has the right method and something's wrong with mine. But maybe they just analyzed their data in a bad way :/
It's been 8 hours i'm trying to obtain the same results for their "key gene" (they mention three in the paper, I assumed those three weren't obtained by miss-interpretation).

Here are the normalized counts (well I think those are, they are the results of :
> cdsNorm = counts(cds, normalized = TRUE)

> cdsNorm[1948,]
countN_rep1 countN_rep2 countN_rep3 countN_rep4 countN_rep5 countN_rep6
1039.511 1218.836 1043.076 1150.849 3892.303 5382.503
countH_rep1 countH_rep2 countH_rep3 countH_rep4
6668.934 90394.665 8004.270 10207.432

As bye suggested, using per-condition dispersion might fix this. There's also the issue of how or whether they shared information across genes (see the "sharingMode" option in estimateDispersions). The default is "maximum", which is a good idea unless you have a lot of samples (more than are here). I suspect that using "per-condition" for the dispersion estimation will resolve this without monkeying around with the sharingMode.

**Dario1984** · 10-30-2013, 06:00 PM

Can you make a MDS plot of your samples ? Which samples are samples 5 and 6 close to ? For the gene, samples 5 and 6 look more like the hypoxia samples. You should explore if that is the case when using hundreds of genes.

**Simon Anders** · 10-31-2013, 01:05 AM

The source of the issue is sample H2, which is 10-fold above all the over samples. This makes DESeq comclude that this gene is extremely variable from sample to sample and hence should not be considered significant.

In other words: Why should we believe that the 3-fold change between the averages of the normal and the hypoxia samples has been caused by the hypoxic treatment, if the data shows us that one can see changes of up to 10-fold even between samples which were not treated differently?

**elipsoid** · 10-31-2013, 08:18 PM

Hi, sorry for the lateness of my answers. I do not believe sample H2 to be wrong, but I believe their H2 experiment wasn't a great success on a reproducibility level (see PCA graph). But I strongly believe samples N5 and N6 are batch biaised. I chose to discard them for my analyses.
I will attach a pdf showing some graphs I generated. I saw on showing that conditions N5 and N6 succombed to something like batch effect. I would gladly done

pdf1 : with all conditions included : diffAnalysis-AllConditions.pdf
pdf2 : with N5 and N6 condition removed : diffAnalysis.pdf
pdf3 : with H2 condition removed : diffAnalysis-H.pdf
pdf4 : with N5, N6 and H2 conditions removed : diffAnalysis-HNN.pdf

In order you get :
1) plotDispEsts(cds, name= "H")
2) plotDispEsts(cds, name= "N")
3) plotMA(res)
4&5) The two next heatmaps where generated with method=blind while computing estimateDispersion
6) plotPCA(vsdFull, intgroup=c("condition"))
7) Venn Diagram of my results cutting them only with two parameters : for Chip : pval < 0.1% and for RNAseq adjusted pval < 0.1%

After using "per-condition" parameter, pvalues got better (and it helped a lot, thanks guys).

I can't really know how to plot a MDS, I will have to look about it if it is necessary.
I must admit this dataset is puzzling me.
Is it a common thing to get that kind of data ?

Thanks again for all your answers !

Topics	Statistics	Last Post
A Closer Look at the Enigmatic Genomes of Oikopleura dioica by seqadmin Started by seqadmin, Yesterday, 06:35 AM	0 responses 15 views 0 likes	Last Post by seqadmin Yesterday, 06:35 AM
Advanced Epigenome Editing Platform Explores Gene Regulation Mechanisms by seqadmin Started by seqadmin, 05-09-2024, 02:46 PM	0 responses 21 views 0 likes	Last Post by seqadmin 05-09-2024, 02:46 PM
Telomere Maintenance by PARP1: A New Perspective in Cancer Research by seqadmin Started by seqadmin, 05-07-2024, 06:57 AM	0 responses 18 views 0 likes	Last Post by seqadmin 05-07-2024, 06:57 AM
Enhanced Neoantigen Detection: Introducing NeoHunter by seqadmin Started by seqadmin, 05-06-2024, 07:17 AM	0 responses 19 views 0 likes	Last Post by seqadmin 05-06-2024, 07:17 AM

Seqanswers Leaderboard Ad

Announcement

RNAseq analysis by DESeq : can't find a gene previously published as important

Comment

Comment

Comment

Comment

Comment

Comment

Comment

Latest Articles

ad_right_rmr

News