I am interested in comparing samples of type A to samples of type B. There are several RNA-seq datasets in GEO that have either type A or type B, but not both. Is it possible to take samples from two different datasets and compare them? I am guessing most of the observed differences will be between the two labs and not between the two conditions. Is that a reasonable concern? Is there a proper way to deal with that?
Unconfigured Ad
Collapse
X
-
I'd be very hesitant to do such a comparison due to the uncontrolled batch effect. I suppose you could try to estimate what sort of batch effect exists by looking at the effect between only the B or A samples, but I'm not sure how well that would work in practice.
-
-
Since with sequencing, there is much more confidence in the observations, which have a built in noise check during the mapping, and it is necessary to account for the overall differences in reads anyway comparing samples from different labs should be straight forward, unlike micro arrays, where batch effects were known to dominate in some cases requiring a high number of technical replicates.
After taking overall read depth into account, you would have to look for subtle effects like bias in GC annealing temperatures or PCR duplicates. Verifying the results in the lab may be difficult though, as with any meta-analysis.
Comment
-
-
There are still batch effects in RNAseq, though they're certainly less of an issue than in the microarray days. I happen to be looking at all of the publicly available mouse hippocampus RNAseq datasets at the moment and decided to create a little heatmap of the variance stabilised data, which you can find below. The datasets are color coded the same on the rows and columns to make life easier (there are 160 samples in the heatmap, so the labels are illegible). While there are obvious experimental differences in some of these datasets, there's still a lab batch-effect. Having said that, if you're interested in different organs or something like that then the difference due to that will be vastly greater than the batch-effect, so rskr's advise should hold-up quite well.
BTW, some of the red-colored samples are technical replicates that I never bothered merging, which is why they cluster the way they do.
Comment
-
Latest Articles
Collapse
-
by SEQadmin2
Data variability is still an issue in sequencing technologies despite the advances in reproducibility and accuracy of these platforms. But the problem does not originate in the sequencing itself, but in the previous steps, before the sample reaches the sequencer.
The first step is collection, followed by preservation and sample preparation for analysis. Most scientists overlook those steps, but not being careful might just be skewing the experiment’s results.
...-
Channel: Articles
06-02-2026, 10:05 AM -
-
by SEQadmin2
With the launch of new single-cell sequencing platforms in 2026, the field stands at an exciting inflection point. This article surveys the most impactful advances in the field and discusses how they’re reshaping research in cancer, immunology, and beyond.
Introduction
Single-cell sequencing technologies have undergone remarkable advances over the past decade, transitioning from low-throughput experimental approaches to highly scalable platforms capable of...-
Channel: Articles
05-22-2026, 06:42 AM -
ad_right_rmr
Collapse
News
Collapse
| Topics | Statistics | Last Post | ||
|---|---|---|---|---|
|
Started by SEQadmin2, Today, 10:09 AM
|
0 responses
9 views
0 reactions
|
Last Post
by SEQadmin2
Today, 10:09 AM
|
||
|
Started by SEQadmin2, Yesterday, 08:59 AM
|
0 responses
16 views
0 reactions
|
Last Post
by SEQadmin2
Yesterday, 08:59 AM
|
||
|
Started by SEQadmin2, 06-02-2026, 12:03 PM
|
0 responses
24 views
0 reactions
|
Last Post
by SEQadmin2
06-02-2026, 12:03 PM
|
||
|
Started by SEQadmin2, 06-02-2026, 11:40 AM
|
0 responses
20 views
0 reactions
|
Last Post
by SEQadmin2
06-02-2026, 11:40 AM
|
Comment