Seqanswers Leaderboard Ad

Collapse
X
 
  • Filter
  • Time
  • Show
Clear All
new posts
  • cnyh
    Member
    • Feb 2013
    • 39

    FPKM or counts?

    I have trouble figuring out whether to use FPKM values for measuring expression, or to stick with raw counts. I've heard that we might have too few biological replicates from each tissue, to be able to rely on the FPKM values. Does anyone else have experience with this issue? How do I decide which to use?
  • bruce01
    Senior Member
    • Mar 2011
    • 160

    #2
    You say you want to measure expression. But are you looking for differential expression between samples, or absolute expression to see what is most highly expressed?

    FPKM is a measure of absolute abundance of a gene and can therefore be used to compare expression between genes.

    Counts are relative. For differential expression analysis you are not looking between genes, but within them across replicates to see if a gene is more highly expressed in a condition or treatment.

    If you are doing DE analysis using DESeq or edgeR for example, use counts. To look at sets of genes which may be co-expressed, for example, then FPKMs may be of interest.

    Comment

    • cnyh
      Member
      • Feb 2013
      • 39

      #3
      Thank you for your quick response!
      - It is indeed differential expression between different samples that we are interested in.
      - We have primarily been using cuffdiff for the purposes of DE.

      How relevant the issue of having only 3 biological replicates for each sample, in deciding whether to choose counts or FPKM?

      Comment

      • iris_aurelia
        Member
        • Jul 2012
        • 22

        #4
        I usually stick to the counts. Using the counts you do know exactly how many reads are mapped to a gene, which I prefer. I always normalize the counts for its library size in order to compare the counts across samples.
        Cufflinks does correct for gene length, but I don't think there is a need to correct for gene length when only comparing genes between samples.

        In order to get the differentially expressed genes I usually use the Voom method which is in the Limma/edgeR package. This method takes raw genecounts as an input and does normalize the data within the voom method.
        Last edited by iris_aurelia; 03-06-2013, 05:47 AM.

        Comment

        • bruce01
          Senior Member
          • Mar 2011
          • 160

          #5
          Not sure the number of replicates is relevant at all in using FPKM or counts.

          Personally I have never been happy with cuff* analysis. It seems very conservative. I like to get something out of my DE analysis, but then someone may criticise that attitude!

          I would try using count data in edgeR if you can use R. The manual is pretty helpful and there are many tutorials on line. 3 replicates per condition is ok, the issue is you won't have too much confidence in the results, unless you use cell culture with a very well defined response to treatment(?)

          @iris_aurelia I agree with not needing to correct gene length: the comparison is within the gene, not between them.

          Comment

          • cnyh
            Member
            • Feb 2013
            • 39

            #6
            I see. And I assume by that one can avoid the stringency issues that cuffdiff has with calculating q-values? That would be very promising. Thank you very much.

            In that case I have a related question, but I don't know what the proper protocol is with asking separate questions within a single thread. Maybe I can link to it here: http://seqanswers.com/forums/showthread.php?t=28117

            Comment

            • pengchy
              Senior Member
              • Feb 2009
              • 116

              #7
              The question still exists: what's the difference between Cuffdiff counts and HTseq counts? And why the counts of cuffdiff have decimal?
              Last edited by pengchy; 04-23-2013, 11:29 PM. Reason: grammer correction

              Comment

              • cnyh
                Member
                • Feb 2013
                • 39

                #8
                The reason for that is that cuffdiff additionally normalizes the counts values.

                Comment

                • pengchy
                  Senior Member
                  • Feb 2009
                  • 116

                  #9
                  Originally posted by cnyh View Post
                  The reason for that is that cuffdiff additionally normalizes the counts values.
                  If cuffdiff has normalized the counts value, it will be not need to normalize again when we using edgeR/DESeq to detect DEG with the output of cuffdiff counts?

                  Comment

                  • cnyh
                    Member
                    • Feb 2013
                    • 39

                    #10
                    That's what I understand, but I'm no expert in this.

                    Comment

                    • carmeyeii
                      Senior Member
                      • Mar 2011
                      • 137

                      #11
                      @Cynh @Pengchy

                      Cufflinks output should not be used for DeSeq/EdgeR. These use raw counts, which you can get after aligning with TopHat using HTSeqCount or similar programs.

                      Check this previous thread; there are a couple others referring to this issue.

                      Discussion of next-gen sequencing related bioinformatics: resources, algorithms, open source efforts, etc

                      Comment

                      Latest Articles

                      Collapse

                      • seqadmin
                        Pathogen Surveillance with Advanced Genomic Tools
                        by seqadmin




                        The COVID-19 pandemic highlighted the need for proactive pathogen surveillance systems. As ongoing threats like avian influenza and newly emerging infections continue to pose risks, researchers are working to improve how quickly and accurately pathogens can be identified and tracked. In a recent SEQanswers webinar, two experts discussed how next-generation sequencing (NGS) and machine learning are shaping efforts to monitor viral variation and trace the origins of infectious...
                        03-24-2025, 11:48 AM
                      • seqadmin
                        New Genomics Tools and Methods Shared at AGBT 2025
                        by seqadmin


                        This year’s Advances in Genome Biology and Technology (AGBT) General Meeting commemorated the 25th anniversary of the event at its original venue on Marco Island, Florida. While this year’s event didn’t include high-profile musical performances, the industry announcements and cutting-edge research still drew the attention of leading scientists.

                        The Headliner
                        The biggest announcement was Roche stepping back into the sequencing platform market. In the years since...
                        03-03-2025, 01:39 PM

                      ad_right_rmr

                      Collapse

                      News

                      Collapse

                      Topics Statistics Last Post
                      Started by seqadmin, Today, 10:17 AM
                      0 responses
                      6 views
                      0 reactions
                      Last Post seqadmin  
                      Started by seqadmin, 03-20-2025, 05:03 AM
                      0 responses
                      49 views
                      0 reactions
                      Last Post seqadmin  
                      Started by seqadmin, 03-19-2025, 07:27 AM
                      0 responses
                      59 views
                      0 reactions
                      Last Post seqadmin  
                      Started by seqadmin, 03-18-2025, 12:50 PM
                      0 responses
                      50 views
                      0 reactions
                      Last Post seqadmin  
                      Working...