Unconfigured Ad

Collapse
X
 
  • Filter
  • Time
  • Show
Clear All
new posts
  • SDPA_Pet
    Senior Member
    • Apr 2013
    • 222

    DESeq2 questions

    Hello, I have a function data sets. I want to look at which functions are over-representative. However, I am only interested in a subset of functions for example DNA synthesis.

    Can I just use the functions related to DNA synthesis (subset table) to do the DESeq2 analysis or I have to use the whole dataset and later found which over-representative functions relative to DNA synthesis?

    Thanks,
    Last edited by SDPA_Pet; 03-27-2017, 08:05 AM.
  • GenoMax
    Senior Member
    • Feb 2008
    • 7142

    #2
    Are you mixing up differential expression analysis (DESeq2) with GO term enrichment analysis (e.g. GeneSCF)?

    Comment

    • SDPA_Pet
      Senior Member
      • Apr 2013
      • 222

      #3
      Originally posted by GenoMax View Post
      Are you mixing up differential expression analysis (DESeq2) with GO term enrichment analysis (e.g. GeneSCF)?
      Well, I don't think so, because DESeq2 can give you log2 fold change of each functions and whether the change significant or not.
      Last edited by SDPA_Pet; 03-27-2017, 08:18 AM.

      Comment

      • wdecoster
        Member
        • Oct 2015
        • 97

        #4
        Originally posted by SDPA_Pet View Post
        Can I just use the functions related to DNA synthesis (subset table) to do the DESeq2 analysis or I have to use the whole dataset and later found which over-representative functions relative to DNA synthesis?
        You should use the whole dataset and later on check if the genes up and down regulated correspond to a certain function.

        Comment

        • gringer
          David Eccles (gringer)
          • May 2011
          • 845

          #5
          DESeq2 expects raw read-level count data as input. I don't expect that it would be possible to get that from a function data set.

          Comment

          • SDPA_Pet
            Senior Member
            • Apr 2013
            • 222

            #6
            Originally posted by gringer View Post
            DESeq2 expects raw read-level count data as input. I don't expect that it would be possible to get that from a function data set.
            Hey guys, I am sorry about the confusion. The dataset that I use the gene counts dataset. (The gene coding for the functional genes).

            So, should I use the subset or total dataset.

            Comment

            • GenoMax
              Senior Member
              • Feb 2008
              • 7142

              #7
              @SDPA_Pet: I think you have metagenomics data that you have somehow mapped as counts --> a function (so not a specific gene). Is that interpretation correct? That may be important for other posters to know.

              In any case cherry picking data is not the way to go with DESeq2.

              Comment

              • SDPA_Pet
                Senior Member
                • Apr 2013
                • 222

                #8
                Originally posted by GenoMax View Post
                @SDPA_Pet: I think you have metagenomics data that you have somehow mapped as counts --> a function (so not a specific gene). Is that interpretation correct? That may be important for other posters to know.

                In any case cherry picking data is not the way to go with DESeq2.
                Right? It's kind of tricky? In our field, we the conception of gene/functions are exchangeable.Anyhow, it is just counts table.

                Just curious,GenoMax? What is difference between the gene table and function table in your field (I would guess your field is biomedical field)? Do you have an example?

                Comment

                • GenoMax
                  Senior Member
                  • Feb 2008
                  • 7142

                  #9
                  DESeq2 is not operating on the identifiers but is considering the counts associated with them.

                  Code:
                  DNA_synthesis      50
                  TCA_Cycle        100
                  OR
                  Code:
                  BRCA1      50
                  TRPV1        100
                  should produce equivalent results from DESeq2.

                  Comment

                  • SDPA_Pet
                    Senior Member
                    • Apr 2013
                    • 222

                    #10
                    Right, my questions about the dataset choosing.

                    In you example, let say DNA_systhesis total have 50 counts. I can go deeper hierarchical level
                    DNA systhesis

                    sub Function 1 5
                    sub function 2 3
                    ...

                    TCA total 100
                    sub Fucition 1 5
                    sub fuction 2 3

                    ....

                    I could choose two way to do this. First, use the total datasets which is 150 functions total, and look at which sub function related to DNA synthesis is overabundant.

                    Or, I could only extract the 50 subfunctions from DNA synthesis and run DEseq to find out which one is overabundant.

                    To me, I only care about one large category. However, the sample size wouldn't be same in the two different analyses

                    Originally posted by GenoMax View Post
                    DESeq2 is not operating on the identifiers but is considering the counts associated with them.

                    Code:
                    DNA_synthesis      50
                    TCA_Cycle        100
                    OR
                    Code:
                    BRCA1      50
                    TRPV1        100
                    should produce equivalent results from DESeq2.
                    Last edited by SDPA_Pet; 03-28-2017, 07:52 AM.

                    Comment

                    • GenoMax
                      Senior Member
                      • Feb 2008
                      • 7142

                      #11
                      You may want to post this question on Bioconductor DESeq2 support site where people with the statistical chops will answer (someone on here may do that as well).

                      Comment

                      • SDPA_Pet
                        Senior Member
                        • Apr 2013
                        • 222

                        #12
                        Originally posted by GenoMax View Post
                        You may want to post this question on Bioconductor DESeq2 support site where people with the statistical chops will answer (someone on here may do that as well).
                        Can you give the link? I google it and find different sites. Not sure which one is the authentic

                        Comment

                        • GenoMax
                          Senior Member
                          • Feb 2008
                          • 7142

                          #13
                          Originally posted by SDPA_Pet View Post
                          Can you give the link? I google it and find different sites. Not sure which one is the authentic
                          Here is the bioconductor support site. Tag your question with deseq2.

                          Comment

                          • SDPA_Pet
                            Senior Member
                            • Apr 2013
                            • 222

                            #14
                            Originally posted by GenoMax View Post
                            Here is the bioconductor support site. Tag your question with deseq2.
                            Hi GenoMax,

                            Thanks. BTW, do you know is there any good forum to discuss R application. Bioconductor.org focus on molecular bioinformatics. I need some forums such as help people with plots (ggplots 2), multivariate statistics, etc.

                            Comment

                            Latest Articles

                            Collapse

                            • GATTACAT
                              Reply to Nine Things a Sample Prep Scientist Thinks About Before Sequencing
                              by GATTACAT
                              Love this - good data definitely starts from good input, and poor input can only give relatively poor data. I particularly like the mention of Nanodrop/absorbance based methods for quantification. It's such a toss up if you'll get an accurate reading or what amounts to a randomly generated number, and a lot of library/sequencing related issues can be traced back to poor quant.
                              07-01-2026, 11:43 AM
                            • SEQadmin2
                              Nine Things a Sample Prep Scientist Thinks About Before Sequencing
                              by SEQadmin2


                              I’m not a sequencing expert. I’m a purification scientist who uses NGS to evaluate workflows my group develops. With this perspective, we think about the sample first and the NGS workflow second. The sequencer is an exceptionally honest reporter, but it can only report on what you give it, so whether you get clean, interpretable data from an NGS workflow is largely determined before you begin.

                              Here are nine questions we think about, in roughly the order they matter, before...
                              06-18-2026, 07:11 AM

                            ad_right_rmr

                            Collapse

                            News

                            Collapse

                            Topics Statistics Last Post
                            Started by SEQadmin2, Yesterday, 11:08 AM
                            0 responses
                            7 views
                            0 reactions
                            Last Post SEQadmin2  
                            Started by SEQadmin2, 06-30-2026, 05:37 AM
                            0 responses
                            11 views
                            0 reactions
                            Last Post SEQadmin2  
                            Started by SEQadmin2, 06-26-2026, 11:10 AM
                            0 responses
                            19 views
                            0 reactions
                            Last Post SEQadmin2  
                            Started by SEQadmin2, 06-17-2026, 06:09 AM
                            0 responses
                            53 views
                            0 reactions
                            Last Post SEQadmin2  
                            Working...