Unconfigured Ad

**dpryan** · 12-14-2015, 11:11 AM

You can simply exclude the patients for whom you only have a single sample. They'll get ignored in the analysis anyway. Anyway, yes your model is correct and you do indeed care most about the "visit:treatment" term.

**andrewelamb** · 12-15-2015, 11:57 AM

Thanks for the answer!

I did get this error however:

Error in checkFullRank(modelMatrix) :
the model matrix is not full rank, so the model cannot be fit as specified.
One or more variables or interaction terms in the design formula are linear
combinations of the others and must be removed.

my pheno file looks like:

sampleName visit condition patient
1 V2 control 1
2 V5 control 1
3 V2 treatment 2
4 V5 treatment 2
5 V5 treatment 3
6 V2 treatment 3
7 V5 treatment 4
8 V2 treatment 4
9 V2 control 5
10 V5 control 5

Removing patients from the experimental design worked. Is there any way, or value, to preserve the patient data?

**dpryan** · 12-16-2015, 12:42 AM

Indeed, I should have foreseen that :P

If you were to instead use "~patient+condition:visit+visit" and got rid of the "conditiontreatment:visitV2" column in the model matrix then the result would work. The original problem was that each condition is comprised of a set of patients, so you can't have patient coefficients and a "condition" coefficient (which is just the average of the patient coefficients!).

Sorry that that's so confusing.

**andrewelamb** · 12-16-2015, 06:30 AM

Thank you for the help!

I apologize, I'm not entirely clear on how to set up my model matrix based on your answer. It seems I would still need every column if I were to use "~patient+condition:visit+visit".

**dpryan** · 12-16-2015, 06:35 AM

I had a typo in my reply, I meant to remove the "conditiontreatment:visitV2" from the model matrix. That'll make it full rank,

**andrewelamb** · 12-16-2015, 07:31 AM

Ahh I see, I'm getting my sample table and the model matrix confused.

So is this the correct way to use my own model matrix?

design_string <- "~patient+condition:visit+visit"
sample_table <- read.table(input_file, row.names = NULL, header = T, sep = ",")
deseq_object <- DESeqDataSetFromHTSeqCount(sampleTable = sample_table,
design = ~condition, #have to have something here
directory = count_folder)
mm <- model.matrix(as.formula(design_string), sample_table)
mm <- mm[,-19] # gets rid of conditiontreatment:visitV2
deseq_object <- DESeq(deseq_object, full=mm, betaPrior=FALSE)

**dpryan** · 12-16-2015, 11:51 AM

Something along those lines at least.

Topics	Statistics	Last Post
Study Captures the First Moments of DNA Replication by SEQadmin2 Started by SEQadmin2, 07-24-2026, 12:17 PM	0 responses 31 views 0 reactions	Last Post by SEQadmin2 07-24-2026, 12:17 PM
Chemotherapy Leaves Detectable DNA Signatures in Childhood Tumors by SEQadmin2 Started by SEQadmin2, 07-23-2026, 11:41 AM	0 responses 23 views 0 reactions	Last Post by SEQadmin2 07-23-2026, 11:41 AM
Single-Cell Atlases Skew Toward European Ancestry, Analysis Finds by SEQadmin2 Started by SEQadmin2, 07-20-2026, 11:10 AM	0 responses 215 views 0 reactions	Last Post by SEQadmin2 07-20-2026, 11:10 AM
UC San Diego Bioengineers Map Gene Function in Human Stem Cells by SEQadmin2 Started by SEQadmin2, 07-13-2026, 10:26 AM	0 responses 79 views 0 reactions	Last Post by SEQadmin2 07-13-2026, 10:26 AM

Unconfigured Ad

eEtting up DESeq 2 analysis

Comment

Comment

Comment

Comment

Comment

Comment

Comment

Latest Articles

ad_right_rmr

News