Seqanswers Leaderboard Ad

**Anomilie** · 06-10-2014, 11:25 PM

I'm having a similar issue where the log2fold value for many exons is also NA, while they show sufficient counts in the data. Did you find a solution/explanation for this behavior?

**areyes** · 06-17-2014, 04:22 AM

Thanks for reporting this! The current implementation of DEXSeq estimates fold changes only for those exons for which a p-adjustes value was calculated (e.g. not being an outlier).

How many of such cases of having a p-value but not a fold change do you see in your data? If they are many, could you maybe send me your object so I could have a closer look to what is happening?

Alejandro

**Anomilie** · 06-17-2014, 08:35 PM

Thanks for your response.

Out of the total 716 genes that are significant (FDR < 0.05), 101 have NA for the log2fold column, but they all have an adjusted p-value.

I am not able to attach the .Rdata object of the dxd variable created using the documentation as the file is too large. Can I send it to you in any other way or is there a more specific object that would be useful for you to debug what is going on? I generated the dxd object using the following code.

Code:

## makeTranscriptDbFromGFF
gffFile <- makeTranscriptDbFromGFF("/Genome_files/Mus_musculus/UCSC/mm10/Annotation/Genes/genes.gtf", format="gtf")

## preparing exonic parts
exonicParts <- disjointExons(gffFile, by="exon", aggregateGenes=FALSE)


align <- "/Samples"

files <- list.files(path=align, pattern="*.bam", full.names=T, recursive=FALSE)

bf1 <- BamFileList(c(files),index=character(), asMates=TRUE)

genehits <- summarizeOverlaps(exonicParts, bf1, mode="IntersectionStrict", ignore.strand=FALSE, singleEnd=FALSE, inter.feature=TRUE, fragments=TRUE)

colData(genehits)$condition <- c("WT", "MT", "WT", "MT", "WT", "MT")
raw_dat <- assays(genehits)$counts

conds <- c("WT1", "MT1", "WT2", "MT2", "WT3", "MT3")
colnames(raw_dat) <- conds
## reorder columns
raw_data <- cbind(raw_dat[,c(1,3,5)], raw_dat[,c(2,4,6)]) 


geneID <- exonicParts$gene_id #-> contains gene id
g <- unlist(geneID)
exonID <- exonicParts$exonic_part # contains exon number

nam <- paste(g,exonID, sep=":")

rownames(raw_dat) <- nam


dxd <- DEXSeqDataSet(raw_data,sampleTable, design, featureID= as.character(exonID), groupID= g)

dxd <- estimateSizeFactors(dxd)
dxd <- estimateDispersions(dxd)

dxd <- testForDEU(dxd)
dxd <- estimateExonFoldChanges(dxd, fitExpToVar="condition")
dxr1 <- DEXSeqResults( dxd )

save(dxd, file="DEXseq_v1.Rdata")

**areyes** · 06-20-2014, 05:47 AM

you could try dropbox or similar tools!

**xuer** · 07-07-2014, 05:06 AM

is it solved?

I have the same problem. I wondering if the problem was solved, if yes, how? could anybody reply?
Thanks!

**quinne5** · 11-17-2014, 08:27 AM

Just wondering if this has ever been solved -I'm having the same issue..

**areyes** · 11-18-2014, 06:01 AM

Sorry for the very late reply. Could you confirm if this happens when you have a moderate number of samples (around 10-12) and for genes with lots of exonic regions?

I think this has to do with the GLMs that are calculated for each gene to estimate the exon fold changes. The way this works is that a model frame is created where the rows is number of exonic regions times number of samples. I set up a threshold in 3000 to the number of rows of the model frame such the fit would not be done if the model frame for a gene passes this threshold. I added an option in the latest development version such that users can increase this number, but consider that the larger this value, the larger it will take to compute. The parameter is the maxRowsMF of the function estimateExonFoldChanges.

Consider that this is a temporary solution, we need to think in a smarter way to deal with this!

Alejandro

**quinne5** · 11-18-2014, 06:52 AM

Thanks for the the update-ill give the development version a go, I have over 20 samples and the analysis is on human genes.

**wanfahmi** · 09-02-2015, 11:36 PM

Can DEXSeq handle more than 70 samples to analyse? I have different human tissue to analyse which not same as normal vs disease as shown in vignette.

Topics	Statistics	Last Post
Gene Misexpression in the Healthy Human Population by seqadmin Started by seqadmin, Yesterday, 06:46 AM	0 responses 9 views 0 likes	Last Post by seqadmin Yesterday, 06:46 AM
New Method for Rapid Genetic Diagnosis of Mendelian Disorders by seqadmin Started by seqadmin, 07-24-2024, 11:09 AM	0 responses 26 views 0 likes	Last Post by seqadmin 07-24-2024, 11:09 AM
Advancing Nanopore Technology for Portable Sensing Devices by seqadmin Started by seqadmin, 07-19-2024, 07:20 AM	0 responses 160 views 0 likes	Last Post by seqadmin 07-19-2024, 07:20 AM
New RNA-Based Gene Writing Technology Achieves Precise Gene Integration by seqadmin Started by seqadmin, 07-16-2024, 05:49 AM	0 responses 127 views 0 likes	Last Post by seqadmin 07-16-2024, 05:49 AM

Seqanswers Leaderboard Ad

Announcement

dexseq results understanding

Comment

Comment

Comment

Comment

Comment

Comment

Comment

Comment

Comment

Latest Articles

ad_right_rmr

News