Unconfigured Ad

Collapse
X
 
  • Filter
  • Time
  • Show
Clear All
new posts
  • combiochem
    Member
    • Jul 2009
    • 11

    Cufflinks and cuffdiff FPKM values

    I have questions about the FPKM values in each program.
    As I understand, the cufflinks or cuffdiff are counting the reads based on the gene structures, when the known annotations are provided. And FPKM value is a kind of absolute value for each transcript. In that case, can we expect the same FPKM value for the same sample, from each program, cuffdiff and cufflinks? or Is my understating not right?

    I tried two programs with same mapping results and same a known annotation file, but the FPKM values are different each other (from transcripts.expr and isoforms.fpkm_tracking files) Is cuffdiff differently estimating the expression of the transcript (FPKM) because of other sample?

    Thanks for any comments in advance.
  • thinkRNA
    Member
    • Jan 2010
    • 94

    #2
    Originally posted by combiochem View Post
    I have questions about the FPKM values in each program.
    As I understand, the cufflinks or cuffdiff are counting the reads based on the gene structures, when the known annotations are provided. And FPKM value is a kind of absolute value for each transcript. In that case, can we expect the same FPKM value for the same sample, from each program, cuffdiff and cufflinks? or Is my understating not right?

    I tried two programs with same mapping results and same a known annotation file, but the FPKM values are different each other (from transcripts.expr and isoforms.fpkm_tracking files) Is cuffdiff differently estimating the expression of the transcript (FPKM) because of other sample?

    Thanks for any comments in advance.
    I think cuffdiff uses a likelihood function to estimate FPKM which is not an absolute value (Read this thread: http://seqanswers.com/forums/showthread.php?t=3961)
    Which other program did you compare it with? If this other program is not using the same methodology, you will not get the same answer.

    Comment

    • combiochem
      Member
      • Jul 2009
      • 11

      #3
      Originally posted by thinkRNA View Post
      I think cuffdiff uses a likelihood function to estimate FPKM which is not an absolute value (Read this thread: http://seqanswers.com/forums/showthread.php?t=3961)
      Which other program did you compare it with? If this other program is not using the same methodology, you will not get the same answer.
      No, I'm not comparing the FPKM values from "other" program. I've compared the FPKM between cufflinks output and cuffdiff output. As far as I know, in cufflinks, when the known annotations are given, the FPKM for each transcript can be obtained (in one sample). Also in cuffdiff tracking file, the FPKM in each sample (q0 and q1) can be obtained. I've used exactly same annotations and mapping results (which have used for cufflinks) for cuffdiff running (therefore the read counts for each transcript should be same), but the FPKM outputs are different from cufflinks. I'm wondering whether the estimation is taking care of both samples in cuffdiff (which is different from cufflinks?) rather than each sample (q0 or q1).

      Comment

      • mrfox
        Senior Member
        • Aug 2010
        • 103

        #4
        i actually went through the cufflinks-cuffcompare-cuffdiff pipeline and I also have the same problems. cufflinks and cuffdiff report different FPKM values for a same transcript.

        Comment

        • Thomas Doktor
          Senior Member
          • Apr 2009
          • 105

          #5
          EDIT
          My question was answered in another thread.
          Last edited by Thomas Doktor; 11-23-2010, 05:02 AM. Reason: Question already answered

          Comment

          • lewewoo
            Member
            • Apr 2011
            • 60

            #6
            Where I can find it? Please post the link, thanks!

            Comment

            • mrfox
              Senior Member
              • Aug 2010
              • 103

              #7
              Please download the latest Cufflinks from the link below. I think now cufflinks and cuffdiff return consistent values.

              Comment

              • reut
                Member
                • Oct 2010
                • 19

                #8
                same problem with the latest Cufflinks version

                I am using cufflinks version 1.0.2 and have the same problem.
                Any suggestions?

                Comment

                • Thomas Doktor
                  Senior Member
                  • Apr 2009
                  • 105

                  #9
                  This might explain it:
                  Cuffdiff and Cufflinks now accept new options controlling whether all hits are counted towards the FPKM denominator, or only those compatible with some transcript in the reference annotation. Counting only compatible hits avoids certain types of bias that arise when one sample contains far more hits that aren't compatible with any transcript than the other sample does. For example, if one sample contains vastly more mapped ribosomal RNA hits, FPKM values will appear lower in that sample, potentially leading to false positive differential expression calls. Cuffdiff by default now uses only compatible hits. Cufflinks still uses total hits by default, as using compatible hit accounting requires a reference GTF.

                  Comment

                  • arrchi
                    Member
                    • Mar 2011
                    • 46

                    #10
                    Have anyone also seen that some isoforms exist in the files generated by Cufflinks but disappear in the files generated by Cuffcompare and Cuffdiff?

                    Comment

                    • lewewoo
                      Member
                      • Apr 2011
                      • 60

                      #11
                      cufflinks or cuffcompare bugs?

                      1. New released 1.3.0, after Cuffcompare, FPKM column contains all 0, missing FPKM values even tracking files have them;

                      2. in all the versions of CuffDiff, if you compare different conditions against the same control samples, the FPKM in the same control samples in different comparing is different; for example,
                      CuffDiff I: condition 1 v.s. condition control;
                      CuffDiff II: condition 2 v.s condition control;

                      after CuffDiff, when FPKM numbers are tracked, the FPKM of Gene X in condition control in CuffDiff I is different from the FPKM of Gene X in condition control in CuffDiff II. GeneX roughly are 20-30% in total annotated genes and the rest are the same.
                      anybody has explanation or suggestions for this? Thanks!

                      Comment

                      • tangx_2010
                        Junior Member
                        • Mar 2011
                        • 5

                        #12
                        Originally posted by lewewoo View Post
                        after CuffDiff, when FPKM numbers are tracked, the FPKM of Gene X in condition control in CuffDiff I is different from the FPKM of Gene X in condition control in CuffDiff II. GeneX roughly are 20-30% in total annotated genes and the rest are the same.
                        anybody has explanation or suggestions for this? Thanks!
                        Did you use the "-N" option? I found that FPKM are different when I used "-N" option. And FPKM is consistent between different comparison when I closed "-N" option.

                        Comment

                        • IBseq
                          Member
                          • Jul 2012
                          • 56

                          #13
                          hi tangx 2010,
                          i have used cuffdiff on the galaxy platform...what exaclty is the N option?

                          how can i select/deselect this option?

                          thanks,ib

                          Comment

                          Latest Articles

                          Collapse

                          • SEQadmin2
                            Nine Things a Sample Prep Scientist Thinks About Before Sequencing
                            by SEQadmin2


                            I’m not a sequencing expert. I’m a purification scientist who uses NGS to evaluate workflows my group develops. With this perspective, we think about the sample first and the NGS workflow second. The sequencer is an exceptionally honest reporter, but it can only report on what you give it, so whether you get clean, interpretable data from an NGS workflow is largely determined before you begin.


                            Here are nine questions we think about, in roughly the order they matter, before...
                            06-18-2026, 07:11 AM
                          • SEQadmin2
                            From Collection to Sequencing: Why Sample Preparation and Preservation Define Sequencing Data
                            by SEQadmin2


                            Data variability is still an issue in sequencing technologies despite the advances in reproducibility and accuracy of these platforms. But the problem does not originate in the sequencing itself, but in the previous steps, before the sample reaches the sequencer.


                            The first step is collection, followed by preservation and sample preparation for analysis. Most scientists overlook those steps, but not being careful might just be skewing the experiment’s results.
                            ...
                            06-02-2026, 10:05 AM
                          • SEQadmin2
                            Single-Cell Sequencing at an Inflection Point: Early Impacts of New Platforms and Emerging Trends
                            by SEQadmin2


                            With the launch of new single-cell sequencing platforms in 2026, the field stands at an exciting inflection point. This article surveys the most impactful advances in the field and discusses how they’re reshaping research in cancer, immunology, and beyond.


                            Introduction

                            Single-cell sequencing technologies have undergone remarkable advances over the past decade, transitioning from low-throughput experimental approaches to highly scalable platforms capable of...
                            05-22-2026, 06:42 AM

                          ad_right_rmr

                          Collapse

                          News

                          Collapse

                          Topics Statistics Last Post
                          Started by SEQadmin2, 06-17-2026, 06:09 AM
                          0 responses
                          21 views
                          0 reactions
                          Last Post SEQadmin2  
                          Started by SEQadmin2, 06-09-2026, 11:58 AM
                          0 responses
                          38 views
                          0 reactions
                          Last Post SEQadmin2  
                          Started by SEQadmin2, 06-05-2026, 10:09 AM
                          0 responses
                          45 views
                          0 reactions
                          Last Post SEQadmin2  
                          Started by SEQadmin2, 06-04-2026, 08:59 AM
                          0 responses
                          49 views
                          0 reactions
                          Last Post SEQadmin2  
                          Working...