The motivation of the development of DESeq and DEXSeq is being able to estimate biological variability between replicates, and take this into account to call differentially expressed genes or exons. If you donĀ“t have replicates, you do not know if the changes that you are observing are due to biological variation or due to the differences in your genotypes. In any experiment is crucial to do replicates, this is the only way to guarantee reproducibility on your differential expressed calls. For more details, you could check:
the discussion of our DEXSeq paper:
Seqanswers Leaderboard Ad
Collapse
Announcement
Collapse
No announcement yet.
X
-
Originally posted by areyes View PostHi gokhulkrishnakilaru,
The error talks by its own: "Underdetermined model; cannot estimate dispersions. Maybe replicates have not been properly specified.", you do not have replicates. Sorry that I can not help.
Alejandro
So no dexseq could work without replicates?
Is that the conclusion?
Is there a possibility to change the declaration while specifying the replicates in this section
Code:samples = data.frame(condition = c("WT", "KO"),replicate=c(1,1),row.names=c("WildType", "KnockOut"),stringsAsFactors=TRUE,check.names = FALSE)
Leave a comment:
-
Hi gokhulkrishnakilaru,
The error talks by its own: "Underdetermined model; cannot estimate dispersions. Maybe replicates have not been properly specified.", you do not have replicates. Sorry that I can not help.
Alejandro
Leave a comment:
-
Any thoughts anybody?
Sorry mods, for bumping up posts.
Urgent task. So, had to.
Leave a comment:
-
Originally posted by areyes View PostYou could, but they are also deleted automatically in the function "read.HTSeqCounts"!
I was successful in making the counts file as you suggested. I ran the script. The following are my errors. Any pointers that could be of help?
Code:ecs<- estimateSizeFactors(ecs) > ecs<- estimateDispersions(ecs) Dispersion estimation. (Progress report: one dot per 100 genes) Error in FUN(c("ENSMUSG00000000078", "ENSMUSG00000000134", "ENSMUSG00000000182", : Underdetermined model; cannot estimate dispersions. Maybe replicates have not been properly specified. In addition: Warning messages: 1: In .local(object, ...) : Exons with less than 11 counts will be discarded. For more details read the documentation, parameter minCount 2: In .local(object, ...) : Genes with more than 70 testable exons will be kicked out of the analysis. For more details read the documentation, parameter maxExon
Last edited by gokhulkrishnakilaru; 10-10-2012, 06:35 AM.
Leave a comment:
-
You could, but they are also deleted automatically in the function "read.HTSeqCounts"!
Leave a comment:
-
Originally posted by areyes View PostBy the way, where can I download the annotation files you used?
That is where I got the one that worked for me.
You can use genome.ucsc.edu and go to tables section. Choose mouse and refseq genes and then refFlat or refGene. Select format to be GTF and if you are successful in preparing the annotations file. Please upload it somewhere or I can invite you to my dropbox. So, that way I have a refseq annotation file.
Thanks for the support, my friend.
Leave a comment:
-
Originally posted by areyes View PostI see, I think the files you are using as input are causing some problems with the output of our htseq python scripts. I will check what is going on. In the meantime you can reformat your files to look more like this:
Code:FBgn0000003:001 0 FBgn0000008:001 0 FBgn0000008:002 0 FBgn0000008:003 0 FBgn0000008:004 1 FBgn0000008:005 4 FBgn0000008:006 1 FBgn0000008:007 18 FBgn0000008:008 4 FBgn0000008:009 16
Leave a comment:
-
I see, I think the files you are using as input are causing some problems with the output of our htseq python scripts. I will check what is going on. In the meantime you can reformat your files to look more like this:
Code:FBgn0000003:001 0 FBgn0000008:001 0 FBgn0000008:002 0 FBgn0000008:003 0 FBgn0000008:004 1 FBgn0000008:005 4 FBgn0000008:006 1 FBgn0000008:007 18 FBgn0000008:008 4 FBgn0000008:009 16
Leave a comment:
-
Originally posted by areyes View Postups, my bad in the gtf extensions thing...
does your files contain NAs also?error in scan(file what nmax sep dec quote skip nlines na.strings line 1 did not have 3 elements
I deleted those lines and it gives me another error sayingError in round(countData) : Non-numeric argument to mathematical function
Any pointers to these issues. This is my counts file's head
Code:"ENSMUSG00000000001" :001 1 "ENSMUSG00000000001" :002 0 "ENSMUSG00000000001" :003 0 "ENSMUSG00000000001" :004 1 "ENSMUSG00000000001" :005 0 "ENSMUSG00000000001" :006 0 "ENSMUSG00000000001" :007 0 "ENSMUSG00000000001" :008 0
Last edited by gokhulkrishnakilaru; 10-10-2012, 05:54 AM.
Leave a comment:
-
Originally posted by areyes View PostHave you checked the size of your first file? Looks like you are replacing your input file with the output file.
Could you please include a reproducible code for your R code? with the output of the sessionInput()? Also, I would not have many hopes in the results without replicates
Alejandro Reyes
Yes, I did check the size of the input file. I am changing the extension. The input file has GTF and the output has GFF as its extension.
My R code is as follows
Code:library(DEXSeq) options(digits=3) setwd("/test/dexseq/") library(DEXSeq) rm(list=ls()) annotationfile = file.path("/test/dexseq/Mus_musculus.GRCm38.68.gff") annotationfile samples = data.frame(condition = c("WT", "KO"),replicate=c(1,1),row.names=c("WildType", "KnockOut"),stringsAsFactors=TRUE,check.names = FALSE) samples fullFilenames<- list.files("/test/dexseq/",full.names=TRUE,pattern="DEXSEQ.txt") fullFilenames ecs<- read.HTSeqCounts(countfiles = fullFilenames,design = samples,flattenedfile = annotationfile) head(counts(ecs)) head(fData(ecs))
Leave a comment:
-
Have you checked the size of your first file? Looks like you are replacing your input file with the output file.
Could you please include a reproducible code for your R code? with the output of the sessionInput()? Also, I would not have many hopes in the results without replicates
Alejandro Reyes
Leave a comment:
-
DEXSEQ Prepare Annotation File and R output
Hi Folks,
I have downloaded the DEXSEQ package from Bioconductor.
When I try to make the annotation file using the dexseq_prepare_annotations.py script, a gff file is generated but is of zero KB in size.
I tried with the following files
Mus_musculus.NCBIM37.64.gtf
Mouse_UCSC_Refgene.gtf (Refgene from UCSC)
Mouse_UCSC_RefFlat.gtf (Refflat from UCSC)
The command I tried was
Code:python dexseq_prepare_annotation.py Mus_musculus.NCBIM37.64.gtf Mus_musculus.NCBIM37.64.gff
Also, the R output from running DEXSEQ using Mus_musculus.GRCm38.68.gf and one wild type and one knock out sample (no replicates) didn't give me anything. All I see is NA in all the columns. I am using the bam files from Tophat aligned to Refseq annotation.
What am I doing wrong? Any pointers are highly appreciated.Tags: None
Latest Articles
Collapse
-
by seqadmin
Non-coding RNAs (ncRNAs) do not code for proteins but play important roles in numerous cellular processes including gene silencing, developmental pathways, and more. There are numerous types including microRNA (miRNA), long ncRNA (lncRNA), circular RNA (circRNA), and more. In this article, we discuss innovative ncRNA research and explore recent technological advancements that improve the study of ncRNAs.
Nobel Prize for MicroRNA Discovery
This week,...-
Channel: Articles
10-07-2024, 08:07 AM -
-
by seqadmin
Metagenomics has improved the way researchers study microorganisms across diverse environments. Historically, studying microorganisms relied on culturing them in the lab, a method that limits the investigation of many species since most are unculturable1. Metagenomics overcomes these issues by allowing the study of microorganisms regardless of their ability to be cultured or the environments they inhabit. Over time, the field has evolved, especially with the advent...-
Channel: Articles
09-23-2024, 06:35 AM -
ad_right_rmr
Collapse
News
Collapse
Topics | Statistics | Last Post | ||
---|---|---|---|---|
Started by seqadmin, 10-02-2024, 04:51 AM
|
0 responses
104 views
0 likes
|
Last Post
by seqadmin
10-02-2024, 04:51 AM
|
||
Started by seqadmin, 10-01-2024, 07:10 AM
|
0 responses
112 views
0 likes
|
Last Post
by seqadmin
10-01-2024, 07:10 AM
|
||
Started by seqadmin, 09-30-2024, 08:33 AM
|
1 response
116 views
0 likes
|
Last Post
by EmiTom
10-07-2024, 06:46 AM
|
||
Started by seqadmin, 09-26-2024, 12:57 PM
|
0 responses
22 views
0 likes
|
Last Post
by seqadmin
09-26-2024, 12:57 PM
|
Leave a comment: