Unconfigured Ad

Collapse
X
 
  • Filter
  • Time
  • Show
Clear All
new posts
  • id0
    Senior Member
    • Sep 2012
    • 130

    Comparing two groups from different RNA-seq datasets

    I am interested in comparing samples of type A to samples of type B. There are several RNA-seq datasets in GEO that have either type A or type B, but not both. Is it possible to take samples from two different datasets and compare them? I am guessing most of the observed differences will be between the two labs and not between the two conditions. Is that a reasonable concern? Is there a proper way to deal with that?
  • dpryan
    Devon Ryan
    • Jul 2011
    • 3478

    #2
    I'd be very hesitant to do such a comparison due to the uncontrolled batch effect. I suppose you could try to estimate what sort of batch effect exists by looking at the effect between only the B or A samples, but I'm not sure how well that would work in practice.

    Comment

    • TiborNagy
      Senior Member
      • Mar 2010
      • 329

      #3
      Well, try to download as many type A and type B datasets as you can and use it as technical repeats. A simple hierarchical clustering will show you discepancies.

      Comment

      • rskr
        Senior Member
        • Oct 2010
        • 249

        #4
        Since with sequencing, there is much more confidence in the observations, which have a built in noise check during the mapping, and it is necessary to account for the overall differences in reads anyway comparing samples from different labs should be straight forward, unlike micro arrays, where batch effects were known to dominate in some cases requiring a high number of technical replicates.

        After taking overall read depth into account, you would have to look for subtle effects like bias in GC annealing temperatures or PCR duplicates. Verifying the results in the lab may be difficult though, as with any meta-analysis.

        Comment

        • id0
          Senior Member
          • Sep 2012
          • 130

          #5
          Is it worth trying to normalize to housekeeping genes or something similar to that?

          Comment

          • rskr
            Senior Member
            • Oct 2010
            • 249

            #6
            Originally posted by id0 View Post
            Is it worth trying to normalize to housekeeping genes or something similar to that?
            I wouldn't normalize anything, but using housekeeping genes to verify your statistical model might not be a bad idea.

            Comment

            • dpryan
              Devon Ryan
              • Jul 2011
              • 3478

              #7
              There are still batch effects in RNAseq, though they're certainly less of an issue than in the microarray days. I happen to be looking at all of the publicly available mouse hippocampus RNAseq datasets at the moment and decided to create a little heatmap of the variance stabilised data, which you can find below. The datasets are color coded the same on the rows and columns to make life easier (there are 160 samples in the heatmap, so the labels are illegible). While there are obvious experimental differences in some of these datasets, there's still a lab batch-effect. Having said that, if you're interested in different organs or something like that then the difference due to that will be vastly greater than the batch-effect, so rskr's advise should hold-up quite well.

              BTW, some of the red-colored samples are technical replicates that I never bothered merging, which is why they cluster the way they do.

              Comment

              Latest Articles

              Collapse

              • GATTACAT
                Reply to Nine Things a Sample Prep Scientist Thinks About Before Sequencing
                by GATTACAT
                Love this - good data definitely starts from good input, and poor input can only give relatively poor data. I particularly like the mention of Nanodrop/absorbance based methods for quantification. It's such a toss up if you'll get an accurate reading or what amounts to a randomly generated number, and a lot of library/sequencing related issues can be traced back to poor quant.
                Yesterday, 11:43 AM
              • SEQadmin2
                Nine Things a Sample Prep Scientist Thinks About Before Sequencing
                by SEQadmin2


                I’m not a sequencing expert. I’m a purification scientist who uses NGS to evaluate workflows my group develops. With this perspective, we think about the sample first and the NGS workflow second. The sequencer is an exceptionally honest reporter, but it can only report on what you give it, so whether you get clean, interpretable data from an NGS workflow is largely determined before you begin.

                Here are nine questions we think about, in roughly the order they matter, before...
                06-18-2026, 07:11 AM
              • SEQadmin2
                From Collection to Sequencing: Why Sample Preparation and Preservation Define Sequencing Data
                by SEQadmin2


                Data variability is still an issue in sequencing technologies despite the advances in reproducibility and accuracy of these platforms. But the problem does not originate in the sequencing itself, but in the previous steps, before the sample reaches the sequencer.


                The first step is collection, followed by preservation and sample preparation for analysis. Most scientists overlook those steps, but not being careful might just be skewing the experiment’s results.
                ...
                06-02-2026, 10:05 AM

              ad_right_rmr

              Collapse

              News

              Collapse

              Topics Statistics Last Post
              Started by SEQadmin2, 06-30-2026, 05:37 AM
              0 responses
              9 views
              0 reactions
              Last Post SEQadmin2  
              Started by SEQadmin2, 06-26-2026, 11:10 AM
              0 responses
              18 views
              0 reactions
              Last Post SEQadmin2  
              Started by SEQadmin2, 06-17-2026, 06:09 AM
              0 responses
              52 views
              0 reactions
              Last Post SEQadmin2  
              Started by SEQadmin2, 06-09-2026, 11:58 AM
              0 responses
              110 views
              0 reactions
              Last Post SEQadmin2  
              Working...