Seqanswers Leaderboard Ad

Collapse

Announcement

Collapse
No announcement yet.
X
 
  • Filter
  • Time
  • Show
Clear All
new posts

  • Overlap/comparison microarray and RNAseq

    Hi all,

    Not sure if this is the proper forum, but here goes. We did our first RNAseq analysis of a fungal genome. We had 2 conditions (stress and non-stressed) and 2 time points with three biological replicates of each condition and time point. One of the time points was also represented on a microarray experiment (Aglilent arrays) done with the same genome under the same conditions.

    I used edgeR to determine the DE genes. There were ~1200 genes DE on the microarray and ~2000 DE for the same time point in the RNAseq experiment. The overlap of DE genes was 590/1200. However the direction of the change was often different between them.

    My questions:
    1) If anyone done a comparison between an Agilient microarray and RNAseq, what sort of concordance was observed?

    2) The edgeR was done after RPKM normalization. How much does that affect the number of DE genes? I have the BAM files from the Bowtie alignment so I can redo the edgeR starting from non-normalized counts.

    3) Is it even worth spending time trying to determine the level of concordance because these technologies are too different to compare?

    I did see a recent paper in BMC Genomics comparing Affy exon arrays with RNAseq and they found a fairly high level of concordance.

    Thanks in advance for any thoughts, comments or advice.

    Regards,
    Maureen

  • #2
    1. Do NOT feed edgeR or DESeq with normalized data. Both tools need raw counts. Otherwise, you just get nonsense. (I guess we need to put this in flashing red letters somewhere, given the number of people who do it wrong.) So, please redo everything with raw counts. Use, e.g., htseq-count to get the raw counts.

    2. When checking for concordance, it is not very helpful to look at number of hits, because genes which are borderline significant will let the concordance look worse than it is. The better idea is to make a scatter plot of log fold change according to microarrays vs log fold change according to RNA-Seq. Plot all genes and use colour to mark significant ones.

    Comment


    • #3
      Originally posted by MDonlin View Post
      3) Is it even worth spending time trying to determine the level of concordance because these technologies are too different to compare?
      Well, technologies might be very different, but you are interested in the biological answer and that should be independent of the technology. If the two techniques give you very different results it would suggest there is something wrong with at least one of them.

      I wouldn't be to much worried about the small overlap, because these sort of experiments (few replicates, multiple test correction) give you a lot of false negative. A study that used 72 replicated suggested that most genes are regulated. The problem is that the genes that you call differentially express should be concordant on the "direction" among the different technologies...

      Comment


      • #4
        Hi Stefanoberrl,
        I totally agree with you. I believe that's a very important step to compare RNA-Seq and Microarray. Is there any work done in this regard? or any comment from anyone who has done this or know this? Thanks.

        Comment


        • #5
          Originally posted by Simon Anders View Post
          1. Do NOT feed edgeR or DESeq with normalized data. Both tools need raw counts. Otherwise, you just get nonsense. (I guess we need to put this in flashing red letters somewhere, given the number of people who do it wrong.) So, please redo everything with raw counts. Use, e.g., htseq-count to get the raw counts.

          2. When checking for concordance, it is not very helpful to look at number of hits, because genes which are borderline significant will let the concordance look worse than it is. The better idea is to make a scatter plot of log fold change according to microarrays vs log fold change according to RNA-Seq. Plot all genes and use colour to mark significant ones.
          Hi Simon,

          Already got the raw data count from RNA-seq which I used htseq-count. But, how to make a scatter plot to compare both RNA-seq and microarray data? Thank you

          Comment


          • #6
            What scatter plot are you talking about? This is a two year old thread! You can't seriously expect to get a useful answer if you post one-sentence questions without any context.

            Comment


            • #7
              Originally posted by Simon Anders View Post
              What scatter plot are you talking about? This is a two year old thread! You can't seriously expect to get a useful answer if you post one-sentence questions without any context.
              Hi Simon,

              Sorry for the silly question. Alright let straight forward. I got 2 raw data from two different platform (microarray and rna-seq). As we all know that RNA-seq data perform better than microarray which gave more sensitivity and deep coverage. So, I want to create a scatter-plot to compare between RNA-seq and Microarray data. So the data should be log2-transformed in oder to get the data more easy to read? I would say that the y-axis (rna-seq) and x-axis (microarray) where the dot scattered around the straight line shows the agreement between 2 data. I have around 500 number of gene from each data. What would you suggest to create this scatterplot? Using R? Please help, thank you!

              Comment


              • #8
                I have the same question. I have my DEG after rnaseq and microarray analysis (DESeq & Affy). I want to compare the 2 datasets in R (scatterplot, Venndiagramm). Any tips or scripts available?

                Comment

                Latest Articles

                Collapse

                • seqadmin
                  Understanding Genetic Influence on Infectious Disease
                  by seqadmin




                  During the COVID-19 pandemic, scientists observed that while some individuals experienced severe illness when infected with SARS-CoV-2, others were barely affected. These disparities left researchers and clinicians wondering what causes the wide variations in response to viral infections and what role genetics plays.

                  Jean-Laurent Casanova, M.D., Ph.D., Professor at Rockefeller University, is a leading expert in this crossover between genetics and infectious...
                  09-09-2024, 10:59 AM
                • seqadmin
                  Addressing Off-Target Effects in CRISPR Technologies
                  by seqadmin






                  The first FDA-approved CRISPR-based therapy marked the transition of therapeutic gene editing from a dream to reality1. CRISPR technologies have streamlined gene editing, and CRISPR screens have become an important approach for identifying genes involved in disease processes2. This technique introduces targeted mutations across numerous genes, enabling large-scale identification of gene functions, interactions, and pathways3. Identifying the full range...
                  08-27-2024, 04:44 AM

                ad_right_rmr

                Collapse

                News

                Collapse

                Topics Statistics Last Post
                Started by seqadmin, Today, 02:44 PM
                0 responses
                8 views
                0 likes
                Last Post seqadmin  
                Started by seqadmin, 09-06-2024, 08:02 AM
                0 responses
                143 views
                0 likes
                Last Post seqadmin  
                Started by seqadmin, 09-03-2024, 08:30 AM
                0 responses
                151 views
                0 likes
                Last Post seqadmin  
                Started by seqadmin, 08-27-2024, 04:40 AM
                0 responses
                158 views
                0 likes
                Last Post seqadmin  
                Working...
                X