Seqanswers Leaderboard Ad

Collapse

Announcement

Collapse
No announcement yet.
X
 
  • Filter
  • Time
  • Show
Clear All
new posts

  • Cufflinks and cuffdiff FPKM values

    I have questions about the FPKM values in each program.
    As I understand, the cufflinks or cuffdiff are counting the reads based on the gene structures, when the known annotations are provided. And FPKM value is a kind of absolute value for each transcript. In that case, can we expect the same FPKM value for the same sample, from each program, cuffdiff and cufflinks? or Is my understating not right?

    I tried two programs with same mapping results and same a known annotation file, but the FPKM values are different each other (from transcripts.expr and isoforms.fpkm_tracking files) Is cuffdiff differently estimating the expression of the transcript (FPKM) because of other sample?

    Thanks for any comments in advance.

  • #2
    Originally posted by combiochem View Post
    I have questions about the FPKM values in each program.
    As I understand, the cufflinks or cuffdiff are counting the reads based on the gene structures, when the known annotations are provided. And FPKM value is a kind of absolute value for each transcript. In that case, can we expect the same FPKM value for the same sample, from each program, cuffdiff and cufflinks? or Is my understating not right?

    I tried two programs with same mapping results and same a known annotation file, but the FPKM values are different each other (from transcripts.expr and isoforms.fpkm_tracking files) Is cuffdiff differently estimating the expression of the transcript (FPKM) because of other sample?

    Thanks for any comments in advance.
    I think cuffdiff uses a likelihood function to estimate FPKM which is not an absolute value (Read this thread: http://seqanswers.com/forums/showthread.php?t=3961)
    Which other program did you compare it with? If this other program is not using the same methodology, you will not get the same answer.

    Comment


    • #3
      Originally posted by thinkRNA View Post
      I think cuffdiff uses a likelihood function to estimate FPKM which is not an absolute value (Read this thread: http://seqanswers.com/forums/showthread.php?t=3961)
      Which other program did you compare it with? If this other program is not using the same methodology, you will not get the same answer.
      No, I'm not comparing the FPKM values from "other" program. I've compared the FPKM between cufflinks output and cuffdiff output. As far as I know, in cufflinks, when the known annotations are given, the FPKM for each transcript can be obtained (in one sample). Also in cuffdiff tracking file, the FPKM in each sample (q0 and q1) can be obtained. I've used exactly same annotations and mapping results (which have used for cufflinks) for cuffdiff running (therefore the read counts for each transcript should be same), but the FPKM outputs are different from cufflinks. I'm wondering whether the estimation is taking care of both samples in cuffdiff (which is different from cufflinks?) rather than each sample (q0 or q1).

      Comment


      • #4
        i actually went through the cufflinks-cuffcompare-cuffdiff pipeline and I also have the same problems. cufflinks and cuffdiff report different FPKM values for a same transcript.

        Comment


        • #5
          EDIT
          My question was answered in another thread.
          Last edited by Thomas Doktor; 11-23-2010, 05:02 AM. Reason: Question already answered

          Comment


          • #6
            Where I can find it? Please post the link, thanks!

            Comment


            • #7
              Please download the latest Cufflinks from the link below. I think now cufflinks and cuffdiff return consistent values.

              Comment


              • #8
                same problem with the latest Cufflinks version

                I am using cufflinks version 1.0.2 and have the same problem.
                Any suggestions?

                Comment


                • #9
                  This might explain it:
                  Cuffdiff and Cufflinks now accept new options controlling whether all hits are counted towards the FPKM denominator, or only those compatible with some transcript in the reference annotation. Counting only compatible hits avoids certain types of bias that arise when one sample contains far more hits that aren't compatible with any transcript than the other sample does. For example, if one sample contains vastly more mapped ribosomal RNA hits, FPKM values will appear lower in that sample, potentially leading to false positive differential expression calls. Cuffdiff by default now uses only compatible hits. Cufflinks still uses total hits by default, as using compatible hit accounting requires a reference GTF.

                  Comment


                  • #10
                    Have anyone also seen that some isoforms exist in the files generated by Cufflinks but disappear in the files generated by Cuffcompare and Cuffdiff?

                    Comment


                    • #11
                      cufflinks or cuffcompare bugs?

                      1. New released 1.3.0, after Cuffcompare, FPKM column contains all 0, missing FPKM values even tracking files have them;

                      2. in all the versions of CuffDiff, if you compare different conditions against the same control samples, the FPKM in the same control samples in different comparing is different; for example,
                      CuffDiff I: condition 1 v.s. condition control;
                      CuffDiff II: condition 2 v.s condition control;

                      after CuffDiff, when FPKM numbers are tracked, the FPKM of Gene X in condition control in CuffDiff I is different from the FPKM of Gene X in condition control in CuffDiff II. GeneX roughly are 20-30% in total annotated genes and the rest are the same.
                      anybody has explanation or suggestions for this? Thanks!

                      Comment


                      • #12
                        Originally posted by lewewoo View Post
                        after CuffDiff, when FPKM numbers are tracked, the FPKM of Gene X in condition control in CuffDiff I is different from the FPKM of Gene X in condition control in CuffDiff II. GeneX roughly are 20-30% in total annotated genes and the rest are the same.
                        anybody has explanation or suggestions for this? Thanks!
                        Did you use the "-N" option? I found that FPKM are different when I used "-N" option. And FPKM is consistent between different comparison when I closed "-N" option.

                        Comment


                        • #13
                          hi tangx 2010,
                          i have used cuffdiff on the galaxy platform...what exaclty is the N option?

                          how can i select/deselect this option?

                          thanks,ib

                          Comment

                          Latest Articles

                          Collapse

                          • seqadmin
                            Exploring the Dynamics of the Tumor Microenvironment
                            by seqadmin




                            The complexity of cancer is clearly demonstrated in the diverse ecosystem of the tumor microenvironment (TME). The TME is made up of numerous cell types and its development begins with the changes that happen during oncogenesis. “Genomic mutations, copy number changes, epigenetic alterations, and alternative gene expression occur to varying degrees within the affected tumor cells,” explained Andrea O’Hara, Ph.D., Strategic Technical Specialist at Azenta. “As...
                            07-08-2024, 03:19 PM
                          • seqadmin
                            Exploring Human Diversity Through Large-Scale Omics
                            by seqadmin


                            In 2003, researchers from the Human Genome Project (HGP) announced the most comprehensive genome to date1. Although the genome wasn’t fully completed until nearly 20 years later2, numerous large-scale projects, such as the International HapMap Project and 1000 Genomes Project, continued the HGP's work, capturing extensive variation and genomic diversity within humans. Recently, newer initiatives have significantly increased in scale and expanded beyond genomics, offering a more detailed...
                            06-25-2024, 06:43 AM

                          ad_right_rmr

                          Collapse

                          News

                          Collapse

                          Topics Statistics Last Post
                          Started by seqadmin, 07-16-2024, 05:49 AM
                          0 responses
                          26 views
                          0 likes
                          Last Post seqadmin  
                          Started by seqadmin, 07-15-2024, 06:53 AM
                          0 responses
                          32 views
                          0 likes
                          Last Post seqadmin  
                          Started by seqadmin, 07-10-2024, 07:30 AM
                          0 responses
                          40 views
                          0 likes
                          Last Post seqadmin  
                          Started by seqadmin, 07-03-2024, 09:45 AM
                          0 responses
                          205 views
                          0 likes
                          Last Post seqadmin  
                          Working...
                          X