Unconfigured Ad

Collapse
X
 
  • Filter
  • Time
  • Show
Clear All
new posts
  • iceage
    Junior Member
    • Apr 2012
    • 1

    Differential gene expression analysis with bioreplicates using EdgeR/DESeq

    Hi everyone,

    I have read and search a lot about this topic but can not find any solution to my problem. May you will be able to help me.

    I am doing an intern-ship in bioinformatics for my master and I have to deal with RNA-seq data. I have 2 sets of experiments (A and B), both having 2 illumina runs of two stages (1 and 2) of a plant. A and B has not been done at the same time and the technology is a bit different, coming up with:
    runs about 30M reads for A,
    runs about 80M reads for B.

    For a given stage the log(RPKM) of the replicates are very well correlated.

    When I use EdgeR to obtain a common dispersion from the counts of each runs searching for differential expressed genes between each stage I obtain 0.86. Which seems far too big regarding the correlation of the RPKM. Moreover the number of differentially expressed genes is not consistent with our affymetrix knowledge (about 250 genes when we expected about 1000 genes).

    I first think about filtering the list of genes from the one having a count per million below 1 in all conditions. I then obtain a dispersion of 0.76 : still to high...

    I also think about getting variance stabilized data (with DESeq) to use with limma but it does not make sense if the samples are not paired, does it?

    I am wondering if I am doing something wrong here and if there are any filtration/computation that I should have done to obtain a more consistent common dispersion.

    Any idea would be really appreciate,

    François
  • Gordon Smyth
    Member
    • Apr 2011
    • 91

    #2
    A few points:

    edgeR is a Bioconductor package, so more detailed help is available on the Bioconductor mailing list than on SEQanswers.

    If you want to get your RNA-seq data into limma, the way to do this is use the voom() function of the limma package. See the limma User's Guide.

    There are any number of things that might be causing problems with your analysis, but there's no to way know from the information that you give. Your dispersion values are very high indeed. Have you used an MDS plot to look at your data?

    Comment

    Latest Articles

    Collapse

    • SEQadmin2
      Nine Things a Sample Prep Scientist Thinks About Before Sequencing
      by SEQadmin2


      I’m not a sequencing expert. I’m a purification scientist who uses NGS to evaluate workflows my group develops. With this perspective, we think about the sample first and the NGS workflow second. The sequencer is an exceptionally honest reporter, but it can only report on what you give it, so whether you get clean, interpretable data from an NGS workflow is largely determined before you begin.

      Here are nine questions we think about, in roughly the order they matter, before...
      06-18-2026, 07:11 AM
    • SEQadmin2
      From Collection to Sequencing: Why Sample Preparation and Preservation Define Sequencing Data
      by SEQadmin2


      Data variability is still an issue in sequencing technologies despite the advances in reproducibility and accuracy of these platforms. But the problem does not originate in the sequencing itself, but in the previous steps, before the sample reaches the sequencer.


      The first step is collection, followed by preservation and sample preparation for analysis. Most scientists overlook those steps, but not being careful might just be skewing the experiment’s results.
      ...
      06-02-2026, 10:05 AM

    ad_right_rmr

    Collapse

    News

    Collapse

    Topics Statistics Last Post
    Started by SEQadmin2, 06-26-2026, 11:10 AM
    0 responses
    13 views
    0 reactions
    Last Post SEQadmin2  
    Started by SEQadmin2, 06-17-2026, 06:09 AM
    0 responses
    48 views
    0 reactions
    Last Post SEQadmin2  
    Started by SEQadmin2, 06-09-2026, 11:58 AM
    0 responses
    107 views
    0 reactions
    Last Post SEQadmin2  
    Started by SEQadmin2, 06-05-2026, 10:09 AM
    0 responses
    125 views
    0 reactions
    Last Post SEQadmin2  
    Working...