DESeq : without replication

younko

Member

Join Date: May 2014

Posts: 24
- Share
- Tweet
#1

DESeq : without replication

08-12-2014, 12:36 AM

Hello

I have a question regarding to the replicates for DESeq.

Here is my data design..

> head(data)

KYM_pre KYM_post GDY_pre GDY_post HKS_pre HKS_post KWK_pre KWK_post PYM_pre PYM_post SHM_pre SHM_post
A1BG 13 24 25 10 47 22 19 12 27 45 50 61
A2M 3 4 4 8 37 8 19 6 4 3 7 11
A2M-AS1 7 1 1 1 10 1 7 7 0 0 0 8
......

> meta

patient drug
KYM_pre 1 pre
KYM_post 1 post
GDY_pre 2 pre
GDY_post 2 post
HKS_pre 3 pre
HKS_post 3 post
KWK_pre 4 pre
KWK_post 4 post
PYM_pre 5 pre
PYM_post 5 post
SHM_pre 6 pre
SHM_post 6 post

=====================================
My goal is to find DEG responsing to the drug..

I tried two strategy neither of them worked..

1) Frankly, I don't have replicates.. so..
d_rare <- newCountDataSet(data, meta)
d_rare <- estimateSizeFactors(d_rare)
d_rare <- estimateDispersions(d_rare, method="blind", sharingMode="fit-only")

dh_fit1 = fitNbinomGLMs(d_rare, count ~ patient + drug)
dh_fit0 = fitNbinomGLMs(d_rare, count ~ patient)

Unfortunately, this model return NO DEG based on adjusted pvalue.

2) I assumes 6 patients having same disease are biological replicates for more solid estimation for dispersion

d_rare <- newCountDataSet(data, meta)
d_rare <- estimateSizeFactors(d_rare)
d_rare <- estimateDispersions(d_rare) # which means using default option (maximum likelihood estimation)

Again, in this second trial I got the following error
"None of your conditions is replicated. Use method='blind' to estimate across conditions, or 'pooled-CR', if you have crossed factors "

Does anyone have an idea? I have replications for patient(e.g. 1 1 2 2 3 3 44 ...etc) and drug(pre/post) too. why I am getting this message?? My suspection is that when I consider the cross design.. for example, for fixed factor(e.g. pre) I only have one patient 1, one patient 2, one patient 3... etc.. Is this the problem??

I also refer the DESeq manual example for pasilla Dataset.
> pasillaDesign
condition libType
untreated1 untreated single-end
untreated2 untreated single-end
untreated3 untreated paired-end
untreated4 untreated paired-end
treated1 treated single-end
treated2 treated paired-end
treated3 treated paired-end

In this design, they could sucessfully run. But in same way(like my model), for fixed single-end, they also only have one treated sample..

> cdsFull = newCountDataSet( pasillaCountTable, pasillaDesign )
> cdsFull = estimateDispersions( cdsFull )

Maybe, I am missing something... please help me..

Last edited by younko; 08-12-2014, 12:38 AM.
Tags: None
dpryan

Devon Ryan

Join Date: Jul 2011

Posts: 3478
- Share
- Tweet
#2

08-12-2014, 01:09 AM

Firstly, use DESeq2 (in fact, you'll see many people using designs very similar to yours with DESeq2).

Secondly, you do have replicates, you just also have a patient effect that you want to compensate for. It's been a while since I've used DESeq1, but the 'pooled-CR' method is probably what you want.
Comment

Previous template Next

Reply to Nine Things a Sample Prep Scientist Thinks About Before Sequencing

by GATTACAT

Love this - good data definitely starts from good input, and poor input can only give relatively poor data. I particularly like the mention of Nanodrop/absorbance based methods for quantification. It's such a toss up if you'll get an accurate reading or what amounts to a randomly generated number, and a lot of library/sequencing related issues can be traced back to poor quant.
- Channel: Articles
07-01-2026, 11:43 AM
Nine Things a Sample Prep Scientist Thinks About Before Sequencing

by SEQadmin2

I’m not a sequencing expert. I’m a purification scientist who uses NGS to evaluate workflows my group develops. With this perspective, we think about the sample first and the NGS workflow second. The sequencer is an exceptionally honest reporter, but it can only report on what you give it, so whether you get clean, interpretable data from an NGS workflow is largely determined before you begin.

Here are nine questions we think about, in roughly the order they matter, before...
- Channel: Articles
06-18-2026, 07:11 AM

Topics	Statistics	Last Post
High-Resolution Sequencing Exposes Hidden Toxoplasma Diversity by SEQadmin2 Started by SEQadmin2, 07-02-2026, 11:08 AM	0 responses 16 views 0 reactions	Last Post by SEQadmin2 07-02-2026, 11:08 AM
New AI Model Captures Long-Range Genomic Signals to Improve RNA Splice Site Prediction by SEQadmin2 Started by SEQadmin2, 06-30-2026, 05:37 AM	0 responses 17 views 0 reactions	Last Post by SEQadmin2 06-30-2026, 05:37 AM
Large-Scale Protein Screen Uncovers Hidden Regulators of Alternative Polyadenylation by SEQadmin2 Started by SEQadmin2, 06-26-2026, 11:10 AM	0 responses 20 views 0 reactions	Last Post by SEQadmin2 06-26-2026, 11:10 AM
Whole-Genome Sequencing Traces Faroe Islands Ancestry to a North Atlantic Founder Population by SEQadmin2 Started by SEQadmin2, 06-17-2026, 06:09 AM	0 responses 54 views 0 reactions	Last Post by SEQadmin2 06-17-2026, 06:09 AM

Unconfigured Ad

DESeq : without replication

Comment

Latest Articles

ad_right_rmr

News