Seqanswers Leaderboard Ad

Collapse

Announcement

Collapse
No announcement yet.
X
 
  • Filter
  • Time
  • Show
Clear All
new posts

  • cuffdif on different FPKM

    hi,all
    Recently, I used tophat and cufflinks to find out different gene expression within three RNA-seq library.

    I use cuffdif to analyze the different gene expression within sample1 and sample2, and then sample1 and sample3.

    when I compare the gene_ex.diff file of sample1 and sample2 with the gene_ex.diff file of sample1 and sample3, I find the the same gene's FPKM of sample1 in this two files is different. I feel confused about that.

    then I find that the every gene's sample1's FPKM in sample 1_2 divide sample1's FPKM in sample 1_3 is a constant

    Is the normalize of library cause the problem?
    I find the read_groups.info file in the output of cuffdif
    this is sampe 1_2
    na_tophat/accepted_hits.bam q1 0 1.61963e+07 1.30384e+07 1.84746 1
    ZM/accepted_hits.bam q2 0 7.76888e+06 1.30384e+07 0.57735 1

    this is sample 1_3

    na_tophat/accepted_hits.bam q1 0 1.61963e+07 1.35408e+07 2.52492 1
    PM/accepted_hits.bam q2 0 6.66703e+06 1.35408e+07 0.411644 1

    If so, why should do this normalization? why not use the raw FPKM ?

    Hope

  • #2
    which version of tophat and cufflinks ?

    Is the fpkm of a gene the same in the cufflinks output and in the cuffdiff output ?

    Comment


    • #3
      Originally posted by NicoBxl View Post
      which version of tophat and cufflinks ?

      Is the fpkm of a gene the same in the cufflinks output and in the cuffdiff output ?
      I used tophat v2.0.3 and cufflinks v2.0.0

      Because I just want to known the different expression in gene level, I didn't use cufflinks to transcript asembly, so I didn't get the cufflinks FPKM.

      I find that cuffdiff do normalize the data based on the library size, so cause gene1 FPKM in sample1 of sample 1_2 is different with gene1 FPKM in sample1 of sample 1_3.

      THis makes me have to do cuffdiff again, if I want to compare sample2 and sample3 roughly.

      Comment


      • #4
        I think using DESeq or edgeR (two R bioconductor package) will be more efficient for you if you have an annotation file.

        First align with tophat
        Then use htseq-count to extract the read count for each feature (in your case, for each gene)
        Then use DESeq or edgeR to compute DE analysis.
        DESeq and edgeR can normalize each sample, and you can after perform clustering on it.

        Comment


        • #5
          Originally posted by drunk_coder View Post
          hi,all
          Recently, I used tophat and cufflinks to find out different gene expression within three RNA-seq library.

          I use cuffdif to analyze the different gene expression within sample1 and sample2, and then sample1 and sample3.

          when I compare the gene_ex.diff file of sample1 and sample2 with the gene_ex.diff file of sample1 and sample3, I find the the same gene's FPKM of sample1 in this two files is different. I feel confused about that.

          then I find that the every gene's sample1's FPKM in sample 1_2 divide sample1's FPKM in sample 1_3 is a constant

          Is the normalize of library cause the problem?
          I find the read_groups.info file in the output of cuffdif
          this is sampe 1_2
          na_tophat/accepted_hits.bam q1 0 1.61963e+07 1.30384e+07 1.84746 1
          ZM/accepted_hits.bam q2 0 7.76888e+06 1.30384e+07 0.57735 1

          this is sample 1_3

          na_tophat/accepted_hits.bam q1 0 1.61963e+07 1.35408e+07 2.52492 1
          PM/accepted_hits.bam q2 0 6.66703e+06 1.35408e+07 0.411644 1

          If so, why should do this normalization? why not use the raw FPKM ?

          Hope
          Hi. Can you please post your Cuffdiff command?

          Comment


          • #6
            Originally posted by hpimentel View Post
            Hi. Can you please post your Cuffdiff command?
            Thank you for your reply

            sample1_2(NTA and PM two samples)
            command : cuffdiff -o NTA_PM -b Human_66.fa -p 8 -u Homo_sapiens.GRCh37.66.gtf ./nta_tophat/accepted_hits.bam ./PM/accepted_hits.bam

            sample1_3 (NTA and ZM two samples)
            command : cuffdiff -o NTA_ZM -b Human_66.fa -p 8 -u Homo_sapiens.GRCh37.66.gtf ./nta_tophat/accepted_hits.bam ./ZM/accepted_hits.bam

            Comment

            Latest Articles

            Collapse

            • seqadmin
              Recent Advances in Sequencing Analysis Tools
              by seqadmin


              The sequencing world is rapidly changing due to declining costs, enhanced accuracies, and the advent of newer, cutting-edge instruments. Equally important to these developments are improvements in sequencing analysis, a process that converts vast amounts of raw data into a comprehensible and meaningful form. This complex task requires expertise and the right analysis tools. In this article, we highlight the progress and innovation in sequencing analysis by reviewing several of the...
              05-06-2024, 07:48 AM
            • seqadmin
              Essential Discoveries and Tools in Epitranscriptomics
              by seqadmin




              The field of epigenetics has traditionally concentrated more on DNA and how changes like methylation and phosphorylation of histones impact gene expression and regulation. However, our increased understanding of RNA modifications and their importance in cellular processes has led to a rise in epitranscriptomics research. “Epitranscriptomics brings together the concepts of epigenetics and gene expression,” explained Adrien Leger, PhD, Principal Research Scientist...
              04-22-2024, 07:01 AM

            ad_right_rmr

            Collapse

            News

            Collapse

            Topics Statistics Last Post
            Started by seqadmin, 05-07-2024, 06:57 AM
            0 responses
            12 views
            0 likes
            Last Post seqadmin  
            Started by seqadmin, 05-06-2024, 07:17 AM
            0 responses
            16 views
            0 likes
            Last Post seqadmin  
            Started by seqadmin, 05-02-2024, 08:06 AM
            0 responses
            22 views
            0 likes
            Last Post seqadmin  
            Started by seqadmin, 04-30-2024, 12:17 PM
            0 responses
            24 views
            0 likes
            Last Post seqadmin  
            Working...
            X