Seqanswers Leaderboard Ad

**hiddenrisk** · 01-27-2015, 11:01 AM

Hi- As I understand the documentation, you need to have a sample data frame that has replicated information. As I read your sample table at present, you have 12 different levels for the conditions (ie. 12 rows), and then only two levels for the "samples" ("ctl" and "test"). Therefore, you are telling DESeq that there are two conditions to test (ctl, and test) rather than the 5 conditions you actually tested. My suggestion would be to use the following code to make your colData:

> samples<-c("X1", "X2", "A1", "A2", "B1", "B2", "C1", 'C2', "D1", "D2")
> condition <- c(rep("ctrl",2),rep("A",2),rep("B",2),rep("C",2),rep("D",2)
> pData = cbind(samples, condition)
> dds <- DESeqDataSetFromMatrix(countData = countsTable, colData=pData, design=~condition)

**Michael Love** · 01-27-2015, 11:15 AM

I can't see either where the error is coming from. Maybe some sanity checks on the matrix you provide to countData, to make sure it is an integer matrix with 10 columns and no NAs.

class(countsTable)
dim(countsTable)
apply(countsTable, 2, summary)

**nw328** · 01-27-2015, 12:15 PM

Hi Michael Love and hiddenrisk,

Thank you so much for your replies- both helped fix the error, for I did need to rework the comparisons, and I did have a stray NA. Thanks again!

**Michael Love** · 01-27-2015, 12:36 PM

I added an NA check to the constructor in front of the negative value check, since I only had an NA check in the validity function which is called later on.

**lmolokin** · 02-13-2015, 11:59 AM

same error

I am getting the same error as OP. What are you guys referring to when you say NAs?

I have 24 samples: 2 treatment groups, 6 subjects with 2 time points each.

My code is as follows:

Code:

designframe <- data.frame(row.names = colnames(countData),
                          tx = factor(c(rep("c",12),rep("g",12))),
                          patient = factor(c("1","1","2","2","3","3","4","4","5","5",
                                             "6","6","1","1","2","2","3","3","4","4","5","5","6","6")),
                          time = factor(c(rep(1:2,12))))

patient <- factor(designframe$patient)
tx <- factor(designframe$tx)
time <- factor(designframe$time)
designframe <- data.frame(tx,patient,time)

dds <- DESeqDataSetFromMatrix(countData, colData = designframe, formula(~patient+time+tx:time))

The DESeqDataSetFromMatrix line results in:

Error in if (any(assay(se) < 0)) { :
missing value where TRUE/FALSE needed

The "sanity checks" yield the following:

Code:

> class(countData)
[1] "data.frame"
> dim(countData)
[1] 51798    24
> apply(countData, 2, summary)
           C211BL   C211M3    C215BL   C215M3    C220BL    C220M3    C305BL    C305M3    C317BL    C317M3   C324BL
Min.          0.0      0.0       0.0      0.0       0.0      0.00      0.00       0.0       0.0       0.0      0.0
1st Qu.       0.0      0.0       0.0      0.0       0.0      0.00      0.00       0.0       0.0       0.0      0.0
Median        0.0      0.0       0.0      0.0       0.0      0.00      0.00       0.0       0.0       0.0      0.0
Mean        288.3    112.4     231.4    159.2     404.6     59.02     74.74     254.6     273.5     327.6    162.1
3rd Qu.      19.0      8.0      17.0     10.0      24.0      2.00      6.00      19.0      15.0      24.0     14.0
Max.    3128000.0 863800.0 2138000.0 746700.0 1714000.0 211400.00 494600.00 1805000.0 5676000.0 5882000.0 419700.0
NA's          1.0      1.0       1.0      1.0       1.0      1.00      1.00       1.0       1.0       1.0      1.0
          C324M3   G211BL   G211M3 G215BL   G215M3   G220BL   G220M3  G305BL  G305M3   G317BL   G317M3   G324BL
Min.         0.0      0.0      0.0      0      0.0      0.0      0.0     0.0     0.0      0.0      0.0      0.0
1st Qu.      0.0      0.0      0.0      0      0.0      0.0      0.0     0.0     0.0      0.0      0.0      0.0
Median       0.0      0.0      0.0      0      0.0      0.0      0.0     0.0     0.0      0.0      0.0      0.0
Mean       199.5    202.7    196.2    209    301.1    170.6    121.6   167.8   155.9    219.8    278.5    211.4
3rd Qu.     17.0     16.0     16.0     18     22.0     11.0      5.0    14.0    12.0     18.0     29.0     17.0
Max.    609700.0 111000.0 146500.0 107700 168800.0 135900.0 132600.0 79130.0 84860.0 104900.0 108000.0 125200.0
NA's         1.0      1.0      1.0      1      1.0      1.0      1.0     1.0     1.0      1.0      1.0      1.0
         G324M3
Min.        0.0
1st Qu.     0.0
Median      0.0
Mean      116.6
3rd Qu.    10.0
Max.    77110.0
NA's        1.0

I see NA's of 1.0 under each column but I'm not sure what that means.

Thanks!

**dpryan** · 02-13-2015, 12:11 PM

Likely unrelated, but you shouldn't have fractional counts.

The summary indicates that you have NA ("not applicable") somewhere. You probably just have a row of them, so:

Code:

countData[which(is.na(countData[,1])),]

will probably show the row in question. Just remove it.

**Michael Love** · 02-13-2015, 12:11 PM

That means that you have some NA's in the matrix, which should only have non-negative integers.

You can try to find them with:

Code:

narows <- apply(countData, 1, function(x) any(is.na(x)))

table(narows)

You need to remove these rows from the countData first:

Code:

countDataClean <- countData[ !narows, ]

**lmolokin** · 02-13-2015, 12:22 PM

There appeared to be a row of NAs but how is that possible? When I checked countData.csv,all counts were integers. Did the NAs somehow get introduced upon import?

Code:

countData = read.csv (file.choose(), header=TRUE, row.names=1)

?

**dpryan** · 02-13-2015, 03:28 PM

If you have a blank line at the end then that'll happen.

Topics	Statistics	Last Post
Gene Misexpression in the Healthy Human Population by seqadmin Started by seqadmin, Yesterday, 06:46 AM	0 responses 9 views 0 likes	Last Post by seqadmin Yesterday, 06:46 AM
New Method for Rapid Genetic Diagnosis of Mendelian Disorders by seqadmin Started by seqadmin, 07-24-2024, 11:09 AM	0 responses 24 views 0 likes	Last Post by seqadmin 07-24-2024, 11:09 AM
Advancing Nanopore Technology for Portable Sensing Devices by seqadmin Started by seqadmin, 07-19-2024, 07:20 AM	0 responses 159 views 0 likes	Last Post by seqadmin 07-19-2024, 07:20 AM
New RNA-Based Gene Writing Technology Achieves Precise Gene Integration by seqadmin Started by seqadmin, 07-16-2024, 05:49 AM	0 responses 127 views 0 likes	Last Post by seqadmin 07-16-2024, 05:49 AM

Seqanswers Leaderboard Ad

Announcement

DESeq2 Error

Comment

Comment

Comment

Comment

Comment

Comment

Comment

Comment

Comment

Latest Articles

ad_right_rmr

News