Unconfigured Ad

Collapse
X
 
  • Filter
  • Time
  • Show
Clear All
new posts
  • pbrand
    Member
    • Feb 2012
    • 13

    help: adjusted p-values in edgeR

    Hi everybody,

    I am relatively new to RNA-Seq and DE statistics. We have chosen to use edgeR for DE analysis and I understood most functions so far or I could answer them via searching the literature. But now I have a problem in understanding the adjustment for multiple testing.

    My questions:
    1. Do I have to adjust? Are the not-adjusted p-values useless (In case I have some significant differences in DE when not-adjusted but none after adjustment)?

    2. Which adjustment method is the best and why?

    Thanks in advance,
    philipp
  • chadn737
    Senior Member
    • Jan 2009
    • 392

    #2
    You should use the adjusted p-values. This controls for false positives that you get when making multiple comparisons.

    EdgeR uses as its default the Benjamini-Hochberg procedure. This is also the default for other programs like DESeq. Other methods are more conservative. I would just stick with Benjamini-Hochberg.
    Last edited by chadn737; 03-14-2012, 05:46 AM.

    Comment

    • pbrand
      Member
      • Feb 2012
      • 13

      #3
      Originally posted by chadn737 View Post
      You should use the adjusted p-values.
      So I can consider them as 'truly' differentially expressed?

      Can I say that I do not have DE, if all not-adjusted p-values which are significant show no significance anymore after adjustment?
      Last edited by pbrand; 03-14-2012, 05:53 AM.

      Comment

      • chadn737
        Senior Member
        • Jan 2009
        • 392

        #4
        Originally posted by pbrand View Post
        So I can consider them as 'truly' different expressed?

        Can I say that I do not have DE, if all not-adjusted p-values which are significant show no significance anymore after adjustment?
        First I should ask how many replicates you have.

        Comment

        • pbrand
          Member
          • Feb 2012
          • 13

          #5
          because of our low budget, I have just two samples with pooled RNA of 40 specimen each.

          Comment

          • chadn737
            Senior Member
            • Jan 2009
            • 392

            #6
            And you are trying to find DE between these two samples? I don't think there is much you can say about differential expression without replicates.

            Comment

            • pbrand
              Member
              • Feb 2012
              • 13

              #7
              Originally posted by chadn737 View Post
              And you are trying to find DE between these two samples? I don't think there is much you can say about differential expression without replicates.
              Unfortunately, yes. But what we can do is trying to find out possible candidate DE - genes for realtime PCR. Therefore I wanted to follow the suggested way to estimate DE via estimated dispersion.

              Hope that some money occurs if the results look promising to get replicates..

              Comment

              • chadn737
                Senior Member
                • Jan 2009
                • 392

                #8
                Originally posted by pbrand View Post
                Unfortunately, yes. But what we can do is trying to find out possible candidate DE - genes for realtime PCR. Therefore I wanted to follow the suggested way to estimate DE via estimated dispersion.

                Hope that some money occurs if the results look promising to get replicates..
                If you are using the data as an initial screen to get candidates that you will verify by qRT-PCR, then its fine to use the unadjusted p-values.

                The qRT-PCR will either confirm these results or not. But for publication of the RNA-seq, you really do need replicates and adjusted p-values.

                Comment

                • pbrand
                  Member
                  • Feb 2012
                  • 13

                  #9
                  Thanks a lot for your answers!
                  Philipp

                  Comment

                  • younko
                    Member
                    • May 2014
                    • 24

                    #10
                    Hello chadn737

                    I also have similar problems.
                    I do not have a replicate. but I have several patients having same disease.

                    For example, in my case, I have 10 patients having same disease.
                    We treated the drug and did RNA-seq for before-drug/after-drug.

                    What we want to do is to find the differentially expressed genes reponsing to the drug. Instead of replications, we have 10 patients samples.. for pre/post.

                    I used edgeR and DESeq for this.. but I could not find the any DEG with adjusted p value .. So I am thinking to look at the p value instead of adjusted pvalue by considering the fact that we can have FP... of course...

                    Would it be okay?

                    Comment

                    • mikep
                      Member
                      • Feb 2011
                      • 45

                      #11
                      Actually, what you describe are replicates. It just so happens that the biological variability is overwhelming your treatment variability.

                      Did you try a paired analysis?

                      As mentioned above, if you are going to validate candidate genes by some other method then the pvalue doesn't matter as much (though you'll waste time wading through a bunch of false positives). If you want to publish it standalone, it has to pass the correction.

                      Comment

                      • mbblack
                        Senior Member
                        • Aug 2009
                        • 245

                        #12
                        Originally posted by younko View Post
                        Hello chadn737

                        I also have similar problems.
                        I do not have a replicate. but I have several patients having same disease.

                        For example, in my case, I have 10 patients having same disease.
                        We treated the drug and did RNA-seq for before-drug/after-drug.

                        What we want to do is to find the differentially expressed genes reponsing to the drug. Instead of replications, we have 10 patients samples.. for pre/post.

                        I used edgeR and DESeq for this.. but I could not find the any DEG with adjusted p value .. So I am thinking to look at the p value instead of adjusted pvalue by considering the fact that we can have FP... of course...

                        Would it be okay?
                        As mikep said, your biological variation between patients is likely the problem, obscuring your treatment affect.

                        You could try simple pairwise, patient by patient, comparisons using some of the available non-parametric methods, like a Rank Product analysis (RANKPROD in R, for example uses permutation tests in the absence of replicates), the R tool GFOLD (which uses bayesion posterior prediction of fold change to calculate p-values and does not require replicates).
                        Michael Black, Ph.D.
                        ScitoVation LLC. RTP, N.C.

                        Comment

                        • younko
                          Member
                          • May 2014
                          • 24

                          #13
                          Thank you for your response.(mikep and mbblack)!!

                          Q1 : So, are you recommending the "Wilcoxon signed-rank test " instead of GLM model from edgeR or DESeq?? This would not consider the variation like GLM anova analysis...... Do you think is it okay?

                          Actually, I made the glm model with EdgeR.

                          design <- model.matrix(~drug)


                          If you want to publish it standalone, it has to pass the correction.
                          Q2 : As I mentioned, if I don't have any DEG with correted pvalue, then, I cannot publish the result??? Or can I validate in anyway with DEG filtered with pvalue instead of adjusted p-value for publication?

                          Comment

                          • mikep
                            Member
                            • Feb 2011
                            • 45

                            #14
                            Well I'm a fan of parametric over parametric, what I was suggesting is a better design in which you take the paired nature of the data into account

                            ... my R is crap but something like design <- model.matrix(~drug+patientID). The paired analysis may well go a long way to removing the biological variability.

                            re Q2: correct, if the pvalues are not significant after correction you cannot publish alone. if you have some kind of orthagonal validation like qPCR then you can.

                            Comment

                            • mbblack
                              Senior Member
                              • Aug 2009
                              • 245

                              #15
                              If you take a look at the GFOLD publication, for example, and the GFOLD documentation in BioConductor, it is more sophisticated than a simple Wilcoxon test. GFOLD represents one effort to deal with the (admittedly undesirable) situtation of not have any biological replicates, yet still attempt to maintain some rigor to how differentially expressed genes are selected.

                              People publish differential gene expression results based on nothing but fold change, but be prepared to fully justify why you did not include replication into your study design (e.g. sometimes, clinical studies, or any study using very difficult to obtain and/or highly limited samples are simply unable to include biological replicates). You would also be best to validate at least some of the genes, at least those critical to your final interpretation or conclusions, or you'll find you are required to anyway to get past peer review.

                              Given that there are published methods for making the most, statistically, from no or very few replicates, you would also be best to explore them, as a reviewer familiar with them will point them out to you if you do not.
                              Michael Black, Ph.D.
                              ScitoVation LLC. RTP, N.C.

                              Comment

                              Latest Articles

                              Collapse

                              ad_right_rmr

                              Collapse

                              News

                              Collapse

                              Topics Statistics Last Post
                              Started by SEQadmin2, 06-05-2026, 10:09 AM
                              0 responses
                              12 views
                              0 reactions
                              Last Post SEQadmin2  
                              Started by SEQadmin2, 06-04-2026, 08:59 AM
                              0 responses
                              24 views
                              0 reactions
                              Last Post SEQadmin2  
                              Started by SEQadmin2, 06-02-2026, 12:03 PM
                              0 responses
                              28 views
                              0 reactions
                              Last Post SEQadmin2  
                              Started by SEQadmin2, 06-02-2026, 11:40 AM
                              0 responses
                              22 views
                              0 reactions
                              Last Post SEQadmin2  
                              Working...