Seqanswers Leaderboard Ad

**dpryan** · 06-10-2014, 04:37 AM

The standard tool is the SVA package, with the combat command.

**vkartha** · 07-02-2014, 06:46 AM

Hi - I have a question related to 'manually' adjusting for batch effects using RNASeq data (and by manually I mean not using built in batch adjustment from packages like edgeR and DESeq2, but using ComBat/gene-wise normalization/linear modelling to adjust for batch effects).

I realize there are a few options to eliminate such effects, but most methods (such as ComBat or a linear model) require normalized (normal) count data to begin with. So for instance, one would use cpm() in edgeR or DESeq to fetch normalized counts (in log space) which can then be used for batch adjustment with the corresponding batch variable from the experimental design.

My question is - upon adjusting these normalized counts for batch effect (through any method), you cannot plug those numbers back in to any differential expression package function (edgeR or DESeq) as this will result in nonsensical results. At the same time - we cannot use raw counts for the batch adjustment prior to normalizing them.

How does one solve this issue? I have a pretty strong batch effect in my data that I'm struggling to remove effectively prior to differential expression testing

Thanks

**dpryan** · 07-02-2014, 09:32 AM

In the case of SVA, you get a list containing the surrogate variables. You then just add them as covariates to your design. Combat() itself produces a tweaked expression-set, which is more useful for something like limma.

**vkartha** · 07-02-2014, 11:17 AM

Originally posted by dpryan View Post

In the case of SVA, you get a list containing the surrogate variables. You then just add them as covariates to your design. Combat() itself produces a tweaked expression-set, which is more useful for something like limma.

Thanks for your reply! I actually did try adding the batch term as a covariate to the design model specification in both edgeR and DESeq2 but I see very few DE genes (10-20 out of 20,000 tested) which is why I was looking to do it independently through ComBat or another method.

My main issue is that I might have my corrected (normalized) counts through independent batch-adjustment methods but any DE package (DESeq, edgeR or even limma's voom) would require raw counts because it does internal normalization/rescaling which would make the corresponding results not make sense anymore.

I don't see an easy way around this (Is there any package or specification where it lets you give it already normalized data without doing any transformation internally?)

Thanks any help would be greatly appreciated

**kopi-o** · 07-07-2014, 03:20 PM

Just use limma. You don't need to do voom().

Topics	Statistics	Last Post
Cancer Metastasis: A Deep Dive into Cellular Plasticity by seqadmin Started by seqadmin, 04-11-2024, 12:08 PM	0 responses 18 views 0 likes	Last Post by seqadmin 04-11-2024, 12:08 PM
Proteogenomic Profiles Offer New Clues in Prostate Cancer by seqadmin Started by seqadmin, 04-10-2024, 10:19 PM	0 responses 22 views 0 likes	Last Post by seqadmin 04-10-2024, 10:19 PM
Novel Diagnostic Assay Enhances Ovarian Cancer Detection by seqadmin Started by seqadmin, 04-10-2024, 09:21 AM	0 responses 16 views 0 likes	Last Post by seqadmin 04-10-2024, 09:21 AM
Evolutionary Dynamics of Centromeres: A Comparative Genomic Analysis by seqadmin Started by seqadmin, 04-04-2024, 09:00 AM	0 responses 47 views 0 likes	Last Post by seqadmin 04-04-2024, 09:00 AM

Seqanswers Leaderboard Ad

Announcement

Batch effect for RNAseq data

Comment

Comment

Comment

Comment

Comment

Latest Articles

ad_right_rmr

News