Seqanswers Leaderboard Ad
Collapse
Announcement
Collapse
No announcement yet.
X
-
thanks for the information and the news about the new version. I will look at the PCA.
-
The difference in the results table is typically due to increased or decreased dispersion estimates, when including other samples.
Take a look at the PCA plot. If the two groups you are comparing, say A and B have higher within group variance than the other groups, then what might be happening is that the dispersion estimates can be lowered by including the other groups (because we estimate a single dispersion value per gene). See the DESeq2 paper for details on the dispersion estimation.
See the vignette section "Access to all calculated values", for extracting parameters. the model matrix is attr(dds, "modelMatrix")
Regarding the LFCs which are near zero but not equal to zero for a contrast of two groups with zeros within a larger analysis, this is expected, but I have also fixed this behavior in the next release (v1.8 released in one week), so that these will be zeroed out ( https://support.bioconductor.org/p/65213/#65254 )
Leave a comment:
-
Originally posted by dariober View PostIn general it is preferable to fit the model to all the samples you have and then, using appropriate contrast matrices, get the comparisons of interest. This way you get better estimates of the coefficients. So your first option is recommended.
My question was more about finding out why I have such a difference in the two tables of results with the p-values.
Is there a way to see exactly what design matrix is used for the actual DESeq2 analysis ( I mean the one with the 0 and 1)?
Leave a comment:
-
In general it is preferable to fit the model to all the samples you have and then, using appropriate contrast matrices, get the comparisons of interest. This way you get better estimates of the coefficients. So your first option is recommended.
If you search the limma vignettes and/or edgeR vignettes and/or bioconductor mailing list you should be able to find similar cases.
Leave a comment:
-
multi-factor vs. pair-wisw design
Hi everyone,
I have a data set of nine different groups, each with three samples. All these groups a are being compared against each other to find differentially regulated genes. All in all I have 14 different comparisons.
I have tested one design matrix for all samples vs. a pair-wise approach, where only the two compared samples were uploaded.
I was wondering which way make more sense, since I'm getting different results, when comparing the two ways.
this is my design matrix for all the samples:
Code:conditionT HP4_1 HP4 HP4_2 HP4 HP4_3 HP4 HP24_1 HP24 HP24_2 HP24 HP24_3 HP24 CR4w_1 CR4w4 CR4w_2 CR4w4 CR4w_3 CR4w4 CR4w24_1 CR4w24 CR4w24_2 CR4w24 CR4w24_3 CR4w24 CTRL4_1 CTRL4 CTRL4_2 CTRL4 CTRL4_3 CTRL4 CTRL24_1 CTRL24 CTRL24_2 CTRL24 CTRL24_3 CTRL24 basalCR4w_1 basalCR4w basalCR4w_2 basalCR4w basalCR4w_3 basalCR4w basalCTRL_1 basalCTRL basalCTRL_2 basalCTRL basalCTRL_3 basalCTRL basalHP_1 basalHP basalHP_2 basalHP basalHP_3 basalHP
Code:conditionT HP4_1 HP4 HP4_2 HP4 HP4_3 HP4 CTRL4_1 CTRL4 CTRL4_2 CTRL4 CTRL4_3 CTRL4
Code:conditionT HP24_1 HP24 HP24_2 HP24 HP24_3 HP24 basalHP_1 basalHP basalHP_2 basalHP basalHP_3 basalHP
Here is a sample of one of the comparisons from the full matrix design
Code:miRNA log2FoldChange padj mmu-miR-29a-3p 0.534368658 0.000259248 mmu-miR-26a-5p 0.378956528 0.000310647 mmu-miR-200a-3p 0.299780505 0.00060916 mmu-miR-29c-3p 0.433273797 0.00060916 mmu-miR-29b-3p 0.625200783 0.001034352 mmu-miR-30d-5p 0.253729371 0.00715 mmu-miR-30a-5p 0.289108972 0.00715 mmu-miR-26b-5p 0.287258966 0.009435688 mmu-miR-30a-3p 0.263099811 0.012596849 mmu-miR-200c-3p 0.480164731 0.016093411 mmu-miR-455-3p 0.734375756 0.016093411 mmu-miR-101a-3p 0.231741597 0.019381496 mmu-miR-101c 0.23216037 0.021264359 mmu-miR-30e-3p 0.276381941 0.026896293 mmu-miR-92b-3p 0.491022916 0.041665933 mmu-miR-99a-5p 0.332214684 0.049609316 mmu-miR-151-5p 0.259334395 0.08039887 mmu-miR-181c-5p -0.226533316 0.08039887 mmu-miR-127-3p -0.5404365 0.092739149 mmu-miR-182-5p 0.460856503 0.095664474 mmu-miR-30e-5p 0.212060214 0.095664474
Code:log2FoldChange padj mmu-miR-29a-3p 0.488112296 0.054110034 mmu-miR-29b-3p 0.531545957 0.080779499 mmu-miR-29c-3p 0.398383972 0.080779499 mmu-miR-451a -0.515259487 0.080779499 mmu-miR-26a-5p 0.35141262 0.086831362
On the other hand, probably also due to the differences in the size factors I am getting log2FC values also in miRNAs, which have no reads attached at all.
Code:miRNA baseMeanA baseMeanB baseMean log2FoldChange lfcSE stat pvalue padj HP24_1 HP24_2 HP24_3 CTRL24_1 CTRL24_2 CTRL24_3 mmu-miR-376b-5p 0 0 0.127918331 -0.000507796 0.046785551 -0.010853703 0.991340168 NA 0 0 0 0 0 0 mmu-miR-1968-3p 0 0 0.127682606 -0.000515398 0.046192878 -0.011157523 NA NA 0 0 0 0 0 0
The full design matrix shows better p-values, but creates possible artefacts in the data set. The pair-wise design shows less significant results.
So, which one of the matrices will show me more realistic results?
thanks,
Assa
Latest Articles
Collapse
-
by seqadmin
Non-coding RNAs (ncRNAs) do not code for proteins but play important roles in numerous cellular processes including gene silencing, developmental pathways, and more. There are numerous types including microRNA (miRNA), long ncRNA (lncRNA), circular RNA (circRNA), and more. In this article, we discuss innovative ncRNA research and explore recent technological advancements that improve the study of ncRNAs.
[Article Coming Soon!]...-
Channel: Articles
Yesterday, 08:07 AM -
-
by seqadmin
Metagenomics has improved the way researchers study microorganisms across diverse environments. Historically, studying microorganisms relied on culturing them in the lab, a method that limits the investigation of many species since most are unculturable1. Metagenomics overcomes these issues by allowing the study of microorganisms regardless of their ability to be cultured or the environments they inhabit. Over time, the field has evolved, especially with the advent...-
Channel: Articles
09-23-2024, 06:35 AM -
-
by seqadmin
During the COVID-19 pandemic, scientists observed that while some individuals experienced severe illness when infected with SARS-CoV-2, others were barely affected. These disparities left researchers and clinicians wondering what causes the wide variations in response to viral infections and what role genetics plays.
Jean-Laurent Casanova, M.D., Ph.D., Professor at Rockefeller University, is a leading expert in this crossover between genetics and infectious...-
Channel: Articles
09-09-2024, 10:59 AM -
ad_right_rmr
Collapse
News
Collapse
Topics | Statistics | Last Post | ||
---|---|---|---|---|
Started by seqadmin, 10-02-2024, 04:51 AM
|
0 responses
14 views
0 likes
|
Last Post
by seqadmin
10-02-2024, 04:51 AM
|
||
Started by seqadmin, 10-01-2024, 07:10 AM
|
0 responses
24 views
0 likes
|
Last Post
by seqadmin
10-01-2024, 07:10 AM
|
||
Started by seqadmin, 09-30-2024, 08:33 AM
|
1 response
31 views
0 likes
|
Last Post
by EmiTom
Yesterday, 06:46 AM
|
||
Started by seqadmin, 09-26-2024, 12:57 PM
|
0 responses
20 views
0 likes
|
Last Post
by seqadmin
09-26-2024, 12:57 PM
|
Leave a comment: