Seqanswers Leaderboard Ad

Collapse

Announcement

Collapse
No announcement yet.
X
 
  • Filter
  • Time
  • Show
Clear All
new posts

  • edgeR: fold change reported by exactTest for zero values of rna-seq

    I have used the exact test in edgeR to compute the log fold changes. Here is the snippet:

    Code:
    d <- DGEList(counts=counts, group=samples$Condition)
    d <- calcNormFactors(d)
    d <- estimateCommonDisp(d)
    d <- estimateTagwiseDisp(d)
    de <- exactTest(d)
    I've noticed that some genes have zero expression in all samples belonging to one of the two conditions. This would make the fold change mathematically undefined (division by zero). Yet the FC is reported as being ~2^-9. My question is - how does edgeR come up with this value? I've checked both the manual and the reference guide but couldn't figure out. There are various functions that accept pseudocounts as parameters but I have entered none in my snippet. So how does edgeR make up for the zero values in this particular case (which seems to be the default usage of the exactTest)?

  • #2
    Good numerical analysts and mathematicians do delta epsilon proofs to figure out what a mathematically undefined quantity should be in specific cases to provide continuity, then redefine the definition in a specific instance. For example a correlation between two sets with zero variance, isn't defined, division by zero, but it's pretty obvious that a value of 1.0 or perfect correlation makes the most sense when doing hierarchical clustering.

    A more general issue is that using fold changes is likely to amplify noise.

    Comment


    • #3
      Thanks, rskr.

      After closer inspection, exactTest seems to be using predFC function which, by default, adds a pseudocount of 0.125 to all observations. This seems to answer it. It would be, perhaps, more transparent to have this as a parameter in exactTest itself but once you dig in the documentation it becomes clear anyway.

      Comment

      Latest Articles

      Collapse

      • seqadmin
        Quality Control Essentials for Next-Generation Sequencing Workflows
        by seqadmin




        Like all molecular biology applications, next-generation sequencing (NGS) workflows require diligent quality control (QC) measures to ensure accurate and reproducible results. Proper QC begins at nucleic acid extraction and continues all the way through to data analysis. This article outlines the key QC steps in an NGS workflow, along with the commonly used tools and techniques.

        Nucleic Acid Quality Control
        Preparing for NGS starts with isolating the...
        02-10-2025, 01:58 PM
      • seqadmin
        An Introduction to the Technologies Transforming Precision Medicine
        by seqadmin


        In recent years, precision medicine has become a major focus for researchers and healthcare professionals. This approach offers personalized treatment and wellness plans by utilizing insights from each person's unique biology and lifestyle to deliver more effective care. Its advancement relies on innovative technologies that enable a deeper understanding of individual variability. In a joint documentary with our colleagues at Biocompare, we examined the foundational principles of precision...
        01-27-2025, 07:46 AM

      ad_right_rmr

      Collapse

      News

      Collapse

      Topics Statistics Last Post
      Started by seqadmin, 02-07-2025, 09:30 AM
      0 responses
      65 views
      0 likes
      Last Post seqadmin  
      Started by seqadmin, 02-05-2025, 10:34 AM
      0 responses
      101 views
      0 likes
      Last Post seqadmin  
      Started by seqadmin, 02-03-2025, 09:07 AM
      0 responses
      81 views
      0 likes
      Last Post seqadmin  
      Started by seqadmin, 01-31-2025, 08:31 AM
      0 responses
      45 views
      0 likes
      Last Post seqadmin  
      Working...
      X