Unconfigured Ad

Collapse
X
 
  • Filter
  • Time
  • Show
Clear All
new posts
  • agwe
    Junior Member
    • Oct 2010
    • 6

    Cuffdiff q-values

    I am having problem understanding the Cuffdiff q-values. I am using Cufdiff v.2.1.1 and I understand that in this version the lowest possible p-value one would see is 5e-05. For my analysis, with 2 conditions and 10 biological replicates each, I am using a GTF file with ~29,000 genes and ~205,000 transcripts.

    As a result I can see that e.g. 300 isoforms in the isoform_exp.diff file have a p-value of 5e-05, but because of the multiple testing correction the q-values for those isoforms are all equal to 0.0504822, so not significant... When I look at the actual values of the replicates FPKMs, they look like truly differentially expressed. It seems that with my setup, using a GTF file with so many possible transcripts, I will never arrive at anything statistically significant with Cuffdiff...

    Similarly, in case of differential gene expression, I have two such experiments, both using the same Cuffmerge generated GTF file, I see:
    1) 199 significant genes, while only 136 of them have p-value of 5e-05
    2) no significant genes, while 12 genes have p-values of 5e-05

    In both experiments I am looking at the same number of genes, shouldn't then the 12 genes from experiment 2) be marked as significant?

    Does anyone see similar behavior?
  • TiborNagy
    Senior Member
    • Mar 2010
    • 329

    #2
    If you use a statistical test many times, you would get significant results by chance. To reduce the number of false positives come from this phenomenon, multiple test correction was used. So you always get equal or smaller number of significant results after multiple test correction.

    Comment

    • agwe
      Junior Member
      • Oct 2010
      • 6

      #3
      I understand the principle behind multiple testing correction. What I don't understand is why in my results I get 12 genes with p-value of 5e-5, which is the lowest value one can see in Cufflinks, yet those are not significant after correction...

      And as the multiple testing correction q-value depends on the number of tests run, or in other words genes/transcripts tested, if I test 200,000 transcripts I will never get a significant p-value in this setup.

      Comment

      • dpryan
        Devon Ryan
        • Jul 2011
        • 3478

        #4
        The multiple testing q-value will depend on the distribution of p-values, not just their number. At least that's the case with the method that's likely being used (Benjamini-Hochberg). It's the incorporation of their distribution (in the null case, we expect p-values to be uniformly distributed, so we can use that expectation to adjust things) that allows you to have significant adjusted p-values even after 10s or 100s of thousands of tests, even with a minimum p-value of 5e-5 (though this doesn't help things).
        Last edited by dpryan; 04-01-2014, 08:22 AM.

        Comment

        Latest Articles

        Collapse

        • SEQadmin2
          Nine Things a Sample Prep Scientist Thinks About Before Sequencing
          by SEQadmin2


          I’m not a sequencing expert. I’m a purification scientist who uses NGS to evaluate workflows my group develops. With this perspective, we think about the sample first and the NGS workflow second. The sequencer is an exceptionally honest reporter, but it can only report on what you give it, so whether you get clean, interpretable data from an NGS workflow is largely determined before you begin.

          Here are nine questions we think about, in roughly the order they matter, before...
          06-18-2026, 07:11 AM
        • SEQadmin2
          From Collection to Sequencing: Why Sample Preparation and Preservation Define Sequencing Data
          by SEQadmin2


          Data variability is still an issue in sequencing technologies despite the advances in reproducibility and accuracy of these platforms. But the problem does not originate in the sequencing itself, but in the previous steps, before the sample reaches the sequencer.


          The first step is collection, followed by preservation and sample preparation for analysis. Most scientists overlook those steps, but not being careful might just be skewing the experiment’s results.
          ...
          06-02-2026, 10:05 AM

        ad_right_rmr

        Collapse

        News

        Collapse

        Topics Statistics Last Post
        Started by SEQadmin2, 06-17-2026, 06:09 AM
        0 responses
        41 views
        0 reactions
        Last Post SEQadmin2  
        Started by SEQadmin2, 06-09-2026, 11:58 AM
        0 responses
        102 views
        0 reactions
        Last Post SEQadmin2  
        Started by SEQadmin2, 06-05-2026, 10:09 AM
        0 responses
        123 views
        0 reactions
        Last Post SEQadmin2  
        Started by SEQadmin2, 06-04-2026, 08:59 AM
        0 responses
        114 views
        0 reactions
        Last Post SEQadmin2  
        Working...