Unconfigured Ad

Collapse
X
 
  • Filter
  • Time
  • Show
Clear All
new posts
  • francisb
    Junior Member
    • Aug 2014
    • 2

    Classify sample based on RNAseq data and micro-array data

    I have a sample on which I will perform RNAseq. I would like to compare its transcriptome to the transcriptomes of several cell types and find the cell type it resembles most. The cell type data is, however, micro-array data.

    Does anyone have an idea how I can correlate my RNAseq data with the micro-array data? How can I include biological replicates of my RNAseq experiment, and how many replicates would I need? (I do not need any statistical significance, I just need to pick the best match)

    Thanks a lot in advance!
  • khughitt
    Junior Member
    • Dec 2012
    • 7

    #2
    Hi Francis,

    Assuming there aren't any publicly available microarray datasets for your cell type of interest, you may be able to make your RNA-Seq data look more like microarray data by log-transforming it and applying voom.

    Then, an easy place to start to look for relationships between the cell types would be to perform look at a PCA plot of all of the samples. In this case, the more closely related samples should group together in the plot.

    HTH,
    Keith

    Comment

    • mbblack
      Senior Member
      • Aug 2009
      • 245

      #3
      Since you need to compare to microarray data, why are you not performing microarray experiments with your data? If you read through the literature you will find that actual relative expression or estimated fold change does not correlate well at all between the two technologies. Nor, at least much of the time, do the two technologies even detect the same sets of significantly differentially expressed genes (the overlap in genes can often be less than 50%).

      If the intent is to try to identify the cell type by matching expression patterns, then I'd say you should at least be comparing directly similar data. Go ahead and do RNAseq with your samples if you want that data for other types of analysis, but don't try and compare RNAseq from an unknown cell type to microarray data of known cell types. I think any conclusions based on that sort of comparison will be highly criticized and far too ambiguous. Expression as measured by the two technologies is not really comparable across different cell types (or tissues or species, etc), not for pattern matching as you are proposing.

      As for how many replicates. how many were used in the microarray experiments? What was the variance in expression observed across those (i.e. how variable are these cell types when it comes to gene expression)? Are these different cell strains, cell lines, or actual wild type cell samples?
      Last edited by mbblack; 08-28-2014, 05:05 AM.
      Michael Black, Ph.D.
      ScitoVation LLC. RTP, N.C.

      Comment

      • francisb
        Junior Member
        • Aug 2014
        • 2

        #4
        Thanks for your replies!

        @mbblack Good question. The plan was to do RNAseq because we want to do other analyses as well, and we prefer RNAseq for those. If micro-array is needed to address this question, then I will consider doing that as well.

        @khughhitt Thanks for this suggestion! How should I include variability across replicates in this type of analysis?

        Comment

        • khughitt
          Junior Member
          • Dec 2012
          • 7

          #5
          When using voom, create a factor for each of your cell types, and possibly another one if you have differing batches across the samples, e.g.:

          R code
          Code:
          celltype = c("a", "a", "a", "b", "b", "b")
          model = model.matrix(~0+celltype) 
          
          # Or, if you have multiple batches, 
          # model = model.matrix(~0+celltype+batch)
          Next, you will want to perform some normalization (e.g. size factor or quantile normalization) on the counts, and call voom passing in the above model:

          Code:
          voom_result = voom(exprs(normed_counts), model, plot=TRUE)
          
          # Your transformed counts
          head(voom_result$E)
          Just to be clear, I haven't actually made this kind of comparison yet, so please take everything with a grain of salt and spend some time reading up on Voom and the differences between microarray/RNA-Seq. I have used the above approach to transform RNA-Seq data for use with Microarray methods with good success, so it seems like a reasonable place to start for your goals.
          Keith

          Good luck!

          Comment

          Latest Articles

          Collapse

          • SEQadmin2
            Nine Things a Sample Prep Scientist Thinks About Before Sequencing
            by SEQadmin2


            I’m not a sequencing expert. I’m a purification scientist who uses NGS to evaluate workflows my group develops. With this perspective, we think about the sample first and the NGS workflow second. The sequencer is an exceptionally honest reporter, but it can only report on what you give it, so whether you get clean, interpretable data from an NGS workflow is largely determined before you begin.


            Here are nine questions we think about, in roughly the order they matter, before...
            06-18-2026, 07:11 AM
          • SEQadmin2
            From Collection to Sequencing: Why Sample Preparation and Preservation Define Sequencing Data
            by SEQadmin2


            Data variability is still an issue in sequencing technologies despite the advances in reproducibility and accuracy of these platforms. But the problem does not originate in the sequencing itself, but in the previous steps, before the sample reaches the sequencer.


            The first step is collection, followed by preservation and sample preparation for analysis. Most scientists overlook those steps, but not being careful might just be skewing the experiment’s results.
            ...
            06-02-2026, 10:05 AM

          ad_right_rmr

          Collapse

          News

          Collapse

          Topics Statistics Last Post
          Started by SEQadmin2, 06-17-2026, 06:09 AM
          0 responses
          30 views
          0 reactions
          Last Post SEQadmin2  
          Started by SEQadmin2, 06-09-2026, 11:58 AM
          0 responses
          44 views
          0 reactions
          Last Post SEQadmin2  
          Started by SEQadmin2, 06-05-2026, 10:09 AM
          0 responses
          50 views
          0 reactions
          Last Post SEQadmin2  
          Started by SEQadmin2, 06-04-2026, 08:59 AM
          0 responses
          51 views
          0 reactions
          Last Post SEQadmin2  
          Working...