DESeq newCountDataSet with seven groups

yve

Junior Member

Join Date: Mar 2012

Posts: 1
- Share
- Tweet
#1

DESeq newCountDataSet with seven groups

03-01-2012, 03:43 AM

Hello,

I have a question regarding the differential analysis of miRNAs with DESeq. I have seven different groups (all with replicates), one control group and six groups corresponding to different cells.
I want to do pairwise comparisons between the control group and each of the other groups. Therefore I did the following:

countData <- newCountDataSet(countTable, groups)
countData <- estimateSizeFactors(countData)
countData <- estimateDispersions(countData, method="pooled")

Next I did the following (group 1 being the control group):

res <- nbinomTest( countData, group1, group2 )
res <- nbinomTest( countData, group1, group3 )
res <- nbinomTest( countData, group1, group4 ) ...

Now my question is, if this is the right way to do this analysis or if I should estimate the size factors and dispersion using only the "needed data" for the specific analysis:

countData <- newCountDataSet(countTable[,c(group1, group2)], groups)
countData <- estimateSizeFactors(countData)
countData <- estimateDispersions(countData, method="pooled)

res <- nbinomTest( countData, group1, group2 )

Thanks in advance!

Yvonne
Tags: None
Simon Anders

Senior Member

Join Date: Feb 2010

Posts: 995
- Share
- Tweet
#2

03-01-2012, 09:47 AM

There are arguments in favour of either way. Estimating the dispersion from all samples has more degrees of freedom and hence yields more precise estimates, which (due to DESeq's "maximum rule") translates into better power.

On the other hand, if replicates agree badly in one group, this will drive up the dispersion estimates for the full data and hence costs power for all comparisons, unless you do everything separately.

I would use the full data, but first check (with a sample clustering after a variance-stabilizing transformation) that there are no bad samples.
Comment

Previous template Next

Essential Discoveries and Tools in Epitranscriptomics

by seqadmin

The field of epigenetics has traditionally concentrated more on DNA and how changes like methylation and phosphorylation of histones impact gene expression and regulation. However, our increased understanding of RNA modifications and their importance in cellular processes has led to a rise in epitranscriptomics research. “Epitranscriptomics brings together the concepts of epigenetics and gene expression,” explained Adrien Leger, PhD, Principal Research Scientist...
- Channel: Articles
04-22-2024, 07:01 AM
Current Approaches to Protein Sequencing

by seqadmin

Proteins are often described as the workhorses of the cell, and identifying their sequences is key to understanding their role in biological processes and disease. Currently, the most common technique used to determine protein sequences is mass spectrometry. While still a valuable tool, mass spectrometry faces several limitations and requires a highly experienced scientist familiar with the equipment to operate it. Additionally, other proteomic methods, like affinity assays, are constrained...
- Channel: Articles
04-04-2024, 04:25 PM

Topics	Statistics	Last Post
Expanding the Horizons of Cellular Research with the Single Cell Atlas by seqadmin Started by seqadmin, Yesterday, 11:49 AM	0 responses 15 views 0 likes	Last Post by seqadmin Yesterday, 11:49 AM
Genetic Variants and Diabetes Risk in Childhood Cancer Survivors by seqadmin Started by seqadmin, 04-24-2024, 08:47 AM	0 responses 16 views 0 likes	Last Post by seqadmin 04-24-2024, 08:47 AM
Cancer Metastasis: A Deep Dive into Cellular Plasticity by seqadmin Started by seqadmin, 04-11-2024, 12:08 PM	0 responses 62 views 0 likes	Last Post by seqadmin 04-11-2024, 12:08 PM
Proteogenomic Profiles Offer New Clues in Prostate Cancer by seqadmin Started by seqadmin, 04-10-2024, 10:19 PM	0 responses 60 views 0 likes	Last Post by seqadmin 04-10-2024, 10:19 PM

Seqanswers Leaderboard Ad

Announcement

DESeq newCountDataSet with seven groups

Comment

Latest Articles

ad_right_rmr

News