Unconfigured Ad

**dpryan** · 04-07-2014, 04:06 AM

It looks a lot like you don't have replicates for your conditions (at least your "conditions" variable would seem to indicate that). Since voom tries to accurately model the mean-variance relationship I wouldn't be surprised if the lack of replications causes serious problems with that. The actual error message is due to it attempting to make a graph of the mean-variance relationship (at least that's my recollection). I recall that you can disable that, though then you'd likely just run into problems downstream (you might want to have a look at the returned values in any case).

For how the design matrix is actually used, it's quite useful if you've taken linear algebra. With linear (or generalized linear) modeling, you're trying to solve the equation:

Code:

Y=B0+B1X+err

Where "Y" is the matrix of observed values, B0 is an intercept or base expression level vector, B1 is the vector containing fit coefficients, "X" is your design matrix and "err" are the residuals. For more details on that, I'd refer you to a statistics/math book or even wikipedia (the English article is OK at least).

**BioLion** · 04-07-2014, 05:00 AM

Thanks a lot for your clear answers. I get what is the design matric now.
True, I don't have replicates, it is an exploratory analysis. Then, if I understand well, the programm wasn't able to estimate a mean-variance relationship and couldn't plot a graph?
Another think: I just tried without precizing plot=TRUE and it gave me a new error:
v <- voom(y,design)
Erreor in approxfun(l, rule = 2) :
need at least two non-NA values to interpolate

However when I tried to see where were the NA-values, it did not returned any...
> sum(is.na(y$counts))
[1] 0
> sum(is.na(y$samples))
[1] 0
> sum(is.na(y$genes))
[1] 0
Could you help me to understand this error?
Thanks again, and sorry if these questions seems too basic.

**dpryan** · 04-07-2014, 05:28 AM

I suspect that stems from the lack of replicates. Since voom tries to fit a mean-variance relationship using lowess regression (at least if my memory serves), then the regression step probably uses approxfun at some point, which wouldn't work with NAs introduced by a lack replicates. I suppose you could do the equivalent of a "blind" estimation, by doing:

Code:

design2 <- model.matrix(~1)
v <- voom(y,design2)

and then using "design" instead of "design2" later on. However I wouldn't put much weight on the results. I suspect that there's no good way to do this without replicates, which should be included even in pilot experiments (the whole idea of a pilot experiment is to gauge the rough effect size, which means you need to know the background variance of at least one of the groups). It's really unfortunate that people so often attempt pilots without replicates...it's mostly a waste of money and time.

**dpryan** · 04-07-2014, 05:30 AM

I should add that before using the "blind" method I outlined, you should probably search the Bioconductor email list for mention of this sort of situation and, if not, ask Gordon Smyth or one of the other authors of voom/limma/edgeR for advice. I suspect that they'll echo my assessment of the usefulness of unreplicated experiments, but then at least you'll get an answer from the absolute most expert people on limma/voom.

**BioLion** · 04-07-2014, 06:11 AM

Thanks again for your answers. I do realize that this is not the ideal situation.
I'll try and contact the authors of voom/limma and then try to use the blind method if I don't find another accurate way of doing this.
Have a good day, and thanks again!

**Donby** · 12-22-2014, 09:03 AM

Hello, BioLion

Have you solved this problem? I met the same question with you.

Topics	Statistics	Last Post
Whole-Genome Sequencing Traces Faroe Islands Ancestry to a North Atlantic Founder Population by SEQadmin2 Started by SEQadmin2, 06-17-2026, 06:09 AM	0 responses 37 views 0 reactions	Last Post by SEQadmin2 06-17-2026, 06:09 AM
Sequencing the Two-Toed Sloth Genome Reveals Jumping Genes Tied to Its Extreme Metabolism by SEQadmin2 Started by SEQadmin2, 06-09-2026, 11:58 AM	0 responses 100 views 0 reactions	Last Post by SEQadmin2 06-09-2026, 11:58 AM
A New Method Makes Hantavirus Genome Analysis Faster and More Accessible by SEQadmin2 Started by SEQadmin2, 06-05-2026, 10:09 AM	0 responses 121 views 0 reactions	Last Post by SEQadmin2 06-05-2026, 10:09 AM
A New Single-Cell Method Maps DNA-Protein Interactions by SEQadmin2 Started by SEQadmin2, 06-04-2026, 08:59 AM	0 responses 113 views 0 reactions	Last Post by SEQadmin2 06-04-2026, 08:59 AM

Unconfigured Ad

Limma/voom for RNA-seq data

Comment

Comment

Comment

Comment

Comment

Comment

Latest Articles

ad_right_rmr

News