hi,all
Recently, I used tophat and cufflinks to find out different gene expression within three RNA-seq library.
I use cuffdif to analyze the different gene expression within sample1 and sample2, and then sample1 and sample3.
when I compare the gene_ex.diff file of sample1 and sample2 with the gene_ex.diff file of sample1 and sample3, I find the the same gene's FPKM of sample1 in this two files is different. I feel confused about that.
then I find that the every gene's sample1's FPKM in sample 1_2 divide sample1's FPKM in sample 1_3 is a constant
Is the normalize of library cause the problem?
I find the read_groups.info file in the output of cuffdif
this is sampe 1_2
na_tophat/accepted_hits.bam q1 0 1.61963e+07 1.30384e+07 1.84746 1
ZM/accepted_hits.bam q2 0 7.76888e+06 1.30384e+07 0.57735 1
this is sample 1_3
na_tophat/accepted_hits.bam q1 0 1.61963e+07 1.35408e+07 2.52492 1
PM/accepted_hits.bam q2 0 6.66703e+06 1.35408e+07 0.411644 1
If so, why should do this normalization? why not use the raw FPKM ?
Hope
Recently, I used tophat and cufflinks to find out different gene expression within three RNA-seq library.
I use cuffdif to analyze the different gene expression within sample1 and sample2, and then sample1 and sample3.
when I compare the gene_ex.diff file of sample1 and sample2 with the gene_ex.diff file of sample1 and sample3, I find the the same gene's FPKM of sample1 in this two files is different. I feel confused about that.
then I find that the every gene's sample1's FPKM in sample 1_2 divide sample1's FPKM in sample 1_3 is a constant
Is the normalize of library cause the problem?
I find the read_groups.info file in the output of cuffdif
this is sampe 1_2
na_tophat/accepted_hits.bam q1 0 1.61963e+07 1.30384e+07 1.84746 1
ZM/accepted_hits.bam q2 0 7.76888e+06 1.30384e+07 0.57735 1
this is sample 1_3
na_tophat/accepted_hits.bam q1 0 1.61963e+07 1.35408e+07 2.52492 1
PM/accepted_hits.bam q2 0 6.66703e+06 1.35408e+07 0.411644 1
If so, why should do this normalization? why not use the raw FPKM ?
Hope
Comment