Seqanswers Leaderboard Ad

Collapse

Announcement

Collapse
No announcement yet.
X
 
  • Filter
  • Time
  • Show
Clear All
new posts

  • #61
    Yes, I did. Why?

    Comment


    • #62
      isn't this what they call trolling?
      /* Shawn Driscoll, Gene Expression Laboratory, Pfaff
      Salk Institute for Biological Studies, La Jolla, CA, USA */

      Comment


      • #63
        I've been following this thread with interest as I always find Simon's answers to be enlightening. I would remind you all that this is not the first time rskr has picked fights with people here or the first time he has asserted that everything everyone else does is bunk.

        Comment


        • #64
          Originally posted by dpryan View Post
          You realise how disingenuous that reply is, I hope. He already directly addressed this above.
          That is the paper he keeps citing as a justification for filtering genes prior to doing tests of significance to improve whatever results he is trying to get, and I don't think that particular paper really applies to whatever it is that he is doing since all the simulations were done with normally generated priors as if they were microarrays, and it isn't very good in the first place since it leads to somewhat absurd assumptions about filtering.

          Comment


          • #65
            Not to belabor the point, but I don't see any reason why two classes with small variance and a small but significant difference in means, should be discarded in preference for classes with large variance and a large difference in means.

            Comment


            • #66
              Originally posted by rskr View Post
              Not to belabor the point, but I don't see any reason why two classes with small variance and a small but significant difference in means, should be discarded in preference for classes with large variance and a large difference in means.
              I'm finally reading this paper now and the first question to you that popped into my head is - have you not heard of dimension reduction? What they are doing here is very similar to the reason one would apply PCA in other types of data. PCA doesn't work to reduce variables when the number of variables is greater than the number of samples so it can't be applied here but the concept is the same. That being the information is in the variance.

              So do you mean to argue against all forms of dimension reduction in all applications or only when applied specifically to differential gene expression? I have to tell you - dimension reduction is of extreme usefulness in many fields. This paper simply appears to be proposing a way to perform a type of dimension reduction followed by statistical testing on a variable by variable basis when the number of variables is >> number of samples.
              /* Shawn Driscoll, Gene Expression Laboratory, Pfaff
              Salk Institute for Biological Studies, La Jolla, CA, USA */

              Comment


              • #67
                Originally posted by sdriscoll View Post
                I'm finally reading this paper now and the first question to you that popped into my head is - have you not heard of dimension reduction? What they are doing here is very similar to the reason one would apply PCA in other types of data. PCA doesn't work to reduce variables when the number of variables is greater than the number of samples so it can't be applied here but the concept is the same. That being the information is in the variance.

                So do you mean to argue against all forms of dimension reduction in all applications or only when applied specifically to differential gene expression? I have to tell you - dimension reduction is of extreme usefulness in many fields. This paper simply appears to be proposing a way to perform a type of dimension reduction followed by statistical testing on a variable by variable basis when the number of variables is >> number of samples.
                PCA does have a similar problem, there are of course other ways to choose orthogonal basis, and for certain applications these may be valid. I am pointing out that in this case, filtering by variance introduces a powerful bias, against the very genes researchers are interested.

                Comment


                • #68
                  So what's your evidence that the genes filtered out are those that researchers are interested in? I think this warrants a case study rather than a theoretical war.
                  /* Shawn Driscoll, Gene Expression Laboratory, Pfaff
                  Salk Institute for Biological Studies, La Jolla, CA, USA */

                  Comment


                  • #69
                    Originally posted by sdriscoll View Post
                    So what's your evidence that the genes filtered out are those that researchers are interested in? I think this warrants a case study rather than a theoretical war.
                    It doesn't seem very complicated, that researchers would prefer separable genes with smaller variances, to separable genes with larger variances. It's not very hard to model for the normal case to see that given the same variance and p-value, one will overlap and one won't. I think most researchers would be more comfortable with the one that doesn't overlap for doing things like making medical decisions.

                    Comment


                    • #70
                      I do wonder why low-expressed genes tend to get the short end of the stick in filtering. They aren't irrelevant...they are obviously expressed for some reason unless we want to argue that we've got extra genes expressed doing nothing at all. One explanation I tend to use is that when their count values are low and additionally their coverage is very low across all samples it's hard to say whether what we're seeing is noise or real evidence that the gene is present but expressed very low. We can look at it as a technical limitation of the sequencing run - we just didn't get enough reads to test those genes.

                      Back to the filtering I do wonder one thing. Low count features tend to have very high coefficients of variation but very low variance values. Highly expressed genes tend to have very small coefficients of variation but very high variance values. Is this maybe why Simon says they use the means instead of the variances in their adaptation of what's outlined in the paper? I fail to see how a linear cutoff could be applied when there's such a clear non-linear trend between count level and variance.
                      /* Shawn Driscoll, Gene Expression Laboratory, Pfaff
                      Salk Institute for Biological Studies, La Jolla, CA, USA */

                      Comment

                      Latest Articles

                      Collapse

                      • seqadmin
                        Strategies for Sequencing Challenging Samples
                        by seqadmin


                        Despite advancements in sequencing platforms and related sample preparation technologies, certain sample types continue to present significant challenges that can compromise sequencing results. Pedro Echave, Senior Manager of the Global Business Segment at Revvity, explained that the success of a sequencing experiment ultimately depends on the amount and integrity of the nucleic acid template (RNA or DNA) obtained from a sample. “The better the quality of the nucleic acid isolated...
                        03-22-2024, 06:39 AM
                      • seqadmin
                        Techniques and Challenges in Conservation Genomics
                        by seqadmin



                        The field of conservation genomics centers on applying genomics technologies in support of conservation efforts and the preservation of biodiversity. This article features interviews with two researchers who showcase their innovative work and highlight the current state and future of conservation genomics.

                        Avian Conservation
                        Matthew DeSaix, a recent doctoral graduate from Kristen Ruegg’s lab at The University of Colorado, shared that most of his research...
                        03-08-2024, 10:41 AM

                      ad_right_rmr

                      Collapse

                      News

                      Collapse

                      Topics Statistics Last Post
                      Started by seqadmin, Yesterday, 06:37 PM
                      0 responses
                      12 views
                      0 likes
                      Last Post seqadmin  
                      Started by seqadmin, Yesterday, 06:07 PM
                      0 responses
                      10 views
                      0 likes
                      Last Post seqadmin  
                      Started by seqadmin, 03-22-2024, 10:03 AM
                      0 responses
                      52 views
                      0 likes
                      Last Post seqadmin  
                      Started by seqadmin, 03-21-2024, 07:32 AM
                      0 responses
                      68 views
                      0 likes
                      Last Post seqadmin  
                      Working...
                      X