Seqanswers Leaderboard Ad

**dpryan** · 06-05-2013, 07:32 AM

It's probably the filtering. Don't just blindly use the 0.3 value for DESeq or 1cpm for edgeR, you should tailor that to your particular dataset (you'll probably have more similar results then).

**sadiexiaoyu** · 06-05-2013, 08:19 AM

Originally posted by dpryan View Post

It's probably the filtering. Don't just blindly use the 0.3 value for DESeq or 1cpm for edgeR, you should tailor that to your particular dataset (you'll probably have more similar results then).

Hi,dpryan,
Thank you for your reply. I filtered my data in DESeq according to this paper (I chose FDR 0.01), http://www.bioconductor.org/packages..._filtering.pdf
But I do not know how to set same parameters in both DESeq or edgeR for filtering. Is there any code in edgeR can do the same filtering process as in DESeq, or vice versa?

Best,

Sadiexiaoyu

**dpryan** · 06-05-2013, 10:00 AM

The easiest example would be the method you used in DESeq and apply it to the edgeR case:

Code:

y <- DGEList(counts=Data, group=condition)
rs = rowSums(Data)
theta=0.3
use=(rs>quantile(rs, probs=theta))
table(use)
yFilt=y[use,]

or something along those lines (I haven't tested it, but that's the gist).

Regarding my earlier comment about blindly applying the threshold, I had assumed that you just used (for example) a theta of 0.3 since that's what the DESeq vignette used. It sounds like you followed the genefilter vignette, so just ignore what I wrote there

**Simon Anders** · 06-05-2013, 10:51 AM

Originally posted by sadiexiaoyu View Post

I filtered my data in DESeq according to this paper (I chose FDR 0.01)

Why 0.01? This is an unusual strict value. You really cannot tolerate more than one percent false positives among your hits? (I hope, BTW, you used the same cut-off for edgeR. Otherwise, a comparison would be rather pointless.)

BTW, why don't you simply use the same filter for both DESeq and edgeR (or, to keep things simpler, no filter at all)?

Simon

**sadiexiaoyu** · 06-05-2013, 11:30 AM

Originally posted by dpryan View Post

The easiest example would be the method you used in DESeq and apply it to the edgeR case:

Code:

y <- DGEList(counts=Data, group=condition)
rs = rowSums(Data)
theta=0.3
use=(rs>quantile(rs, probs=theta))
table(use)
yFilt=y[use,]

or something along those lines (I haven't tested it, but that's the gist).

Regarding my earlier comment about blindly applying the threshold, I had assumed that you just used (for example) a theta of 0.3 since that's what the DESeq vignette used. It sounds like you followed the genefilter vignette, so just ignore what I wrote there

Hi, dpryan,
Thank you for your help! I tried the other method to make edgeR and DESeq filter around same reads:
In edgeR, when you run the scripts in the following,
> colnames(y)<-targets$Label
> dim(y)
[1] 26788 6
> keep<-rowSums(cpm(y)>1)>=3
> y<-y[keep,]
> dim(y)
[1] 17613 6
you can see that you have filtered 26788-17613=9175 low count reads.
In Deseq, when you run
cds=newCountDataSet(x[,1:6],condition)
> rs=rowSums(counts(cds))
> theta=0.34
> use=(rs>quantile(rs,probs=theta))
> table(use)
use
FALSE TRUE
9148 17640
so you filtered 9148 low count reads, which is very similar with 9175 in edgeR (here I used theta=0.34).
Then I run DESeq again. But still, I get very similar results as before (just several genes are added).
And then I tried without filter method, I get less genes, and still, all the DESeq genes are belonging to edgeR genes （more than 200）result.

So maybe it is not the filtering problem?

Could it be the analysis differences between edgeR and DESeq?

And interesting thing is that DESeq genes are belonging to edgeR genes...

Best,

Sadiexiaoyu

**sadiexiaoyu** · 06-05-2013, 11:34 AM

Originally posted by Simon Anders View Post

Why 0.01? This is an unusual strict value. You really cannot tolerate more than one percent false positives among your hits? (I hope, BTW, you used the same cut-off for edgeR. Otherwise, a comparison would be rather pointless.)

BTW, why don't you simply use the same filter for both DESeq and edgeR (or, to keep things simpler, no filter at all)?

Simon

Hi, Simon,

I just replied as #6. I do not know whether my method is right to make the filter similar between edgeR and DESeq.
Besides, for FDR 0.01, maybe it is too strict...but for edgeR, I also choose genes with FDR<0.01.
I will also try FDR<0.05 later to see what is the difference between 0.05 and 0.01 in the final results.

Best,

Sadiexiaoyu

**dpryan** · 06-05-2013, 11:48 AM

Hi Sadie, if the unfiltered data produces that difference, then it must be algorithmic. I've not run into that big of a difference in my datasets, so I can't give you any ready insight regarding why that might happen. It'd be interesting to just visually look at the data (with IGV or similar) to see if the edgeR results seem correct or not.

**sadiexiaoyu** · 06-06-2013, 06:49 AM

Originally posted by dpryan View Post

Hi Sadie, if the unfiltered data produces that difference, then it must be algorithmic. I've not run into that big of a difference in my datasets, so I can't give you any ready insight regarding why that might happen. It'd be interesting to just visually look at the data (with IGV or similar) to see if the edgeR results seem correct or not.

Hi, dpryan,

Thank you for your suggestion

I think maybe the DESeq is more strict than edgeR, although I do not know exactly why. I will try your suggestion and see what happens. Thanks!

Best,

Sadiexiaoyu

Topics	Statistics	Last Post
Study Reveals How Bacteria Defend Against Viral Attacks by seqadmin Started by seqadmin, 08-27-2024, 04:40 AM	0 responses 16 views 0 likes	Last Post by seqadmin 08-27-2024, 04:40 AM
New Single-Molecule Sequencing Platform Introduces Advanced Features for High-Throughput Genomics by seqadmin Started by seqadmin, 08-22-2024, 05:00 AM	0 responses 293 views 0 likes	Last Post by seqadmin 08-22-2024, 05:00 AM
New DNA Code Discovered Revealing Complex Gene Regulation Mechanisms by seqadmin Started by seqadmin, 08-21-2024, 10:49 AM	0 responses 135 views 0 likes	Last Post by seqadmin 08-21-2024, 10:49 AM
Epigenetic Clocks Derived from Retroelements Offer New Insights into Aging by seqadmin Started by seqadmin, 08-19-2024, 05:12 AM	0 responses 124 views 0 likes	Last Post by seqadmin 08-19-2024, 05:12 AM

Seqanswers Leaderboard Ad

Announcement

different gene numbers in edgeR and DESeq

Comment

Comment

Comment

Comment

Comment

Comment

Comment

Comment

Latest Articles

ad_right_rmr

News