Seqanswers Leaderboard Ad

**jordiet** · 11-15-2012, 02:30 AM

Hi devking,

sorry I can't help you with your CuffDiff issue. Have you used BY4741 as a reference genome for your alignments? I am working with the same strain and I would like to use it as a reference genome to align my evolved strains to this one. Do you know where can I find an assembled genome of BY4741 to use as a reference?

**tir_al** · 11-15-2012, 03:20 AM

Dear Devkin,

The heatmap you posted was done in R using the ggplot2 package.

As you are already using the tuxedo pipeline, you could take a look at the cummeRbund

which should enable you to get easy data integration.

Best

**NicoBxl** · 11-15-2012, 05:36 AM

Did you try DESeq ? There is an interesting clustering section in the DESeq vignette on bioconductor.

It's pretty simple to use :

1. align
2. Extract read count with htseq-count
3. Use DESeq to perform DE analysis and plot some results

Check : http://bioconductor.org/packages/rel...tml/DESeq.html

**devking** · 11-15-2012, 11:22 AM

Originally posted by jordiet View Post

Hi devking,

sorry I can't help you with your CuffDiff issue. Have you used BY4741 as a reference genome for your alignments? I am working with the same strain and I would like to use it as a reference genome to align my evolved strains to this one. Do you know where can I find an assembled genome of BY4741 to use as a reference?

Hi Jordiet

For my analysis I was only interested in quantifying expression according to a known and annotated genome so I used the Ensembl release 69 EF4 genome.

I figured since the transcripts in yeast are so well-annotated already it seemed like more trouble than it was worth to assemble a new transcriptome using cufflinks/cuffmerge.

I might be misunderstanding your question, but it seems like you're more interested in doing a full analysis as outlined in the nat protocol paper "Differential gene and transcript expression analysis of RNA-seq experiments with TopHat and Cufflinks" in which you'll have your own assembled transcriptome annotation to quantify.

check the paper: http://www.nature.com/nprot/journal/....2012.016.html

Best,
Devin

**devking** · 11-15-2012, 11:36 AM

Originally posted by tir_al View Post

As you are already using the tuxedo pipeline, you could take a look at the cummeRbund

Hi Tir,

Thanks for your response! I have been using cummeRbund and it has been very easy to use so far to make heatmaps of gene expression. But I didn't immediately see a way to look at *fold-change* expression between two conditions. (I've never used something like R before so things that should be simple usually aren't for me

). However, on the advice of one of the cummeRbund authors, Loyal, I was able to make log fold-change heat maps by extracting raw FPKM values from fpkmMatrix(), do the log2FC transformation in R, then use heatmap.2() from gplots to create something like the attached file.

Attached Files

165OverlappingGenesCLUSTER.png (39.4 KB, 249 views)

**devking** · 11-15-2012, 11:43 AM

Originally posted by NicoBxl View Post

Did you try DESeq ? There is an interesting clustering section in the DESeq vignette on bioconductor.

Hi Nico,

Thanks for the tip. It seems like all of the differential gene expression statistical methods (e.g. cuffdiff, DESeq, edgeR, NOIseq etc) return slightly different sets of significantly differentially expressed genes. E.g. check out:

http://nar.oxfordjournals.org/content/early/2012/09/08/nar.gks804.full

So I was actually planning on using DESeq next and comparing the results with my cuffdiff data. I was thinking of just focusing on the genes that tested significant for DE from both methods. Not sure if this is reasonable or worth the time...

Best,
Devin

**emolinari** · 06-17-2013, 10:39 AM

Originally posted by devking View Post

Hi Tir,

Thanks for your response! I have been using cummeRbund and it has been very easy to use so far to make heatmaps of gene expression. But I didn't immediately see a way to look at *fold-change* expression between two conditions. (I've never used something like R before so things that should be simple usually aren't for me

). However, on the advice of one of the cummeRbund authors, Loyal, I was able to make log fold-change heat maps by extracting raw FPKM values from fpkmMatrix(), do the log2FC transformation in R, then use heatmap.2() from gplots to create something like the attached file.

Hi Devking,

I was looking to do the same plot as you did...is it done with DESeq?
did you use a customized R script?
Thanks!!!
Manu

**devking** · 06-17-2013, 12:13 PM

I quantified expression using the 'Tuxedo' protocol (http://www.nature.com/nprot/journal/....2012.016.html) and used a custom R script to generate the heatmap. Using cummeRbund, you can get a matrix of FPKM expression values, and then use heatmap.2() in the R 'gplots' package for the heatmap call. You could also import your expression matrix into Java TreeView which also plots nice heatmaps. Hope this helps!

**emolinari** · 06-17-2013, 12:20 PM

Originally posted by devking View Post

I quantified expression using the 'Tuxedo' protocol (http://www.nature.com/nprot/journal/....2012.016.html) and used a custom R script to generate the heatmap. Using cummeRbund, you can get a matrix of FPKM expression values, and then use heatmap.2() in the R 'gplots' package for the heatmap call. You could also import your expression matrix into Java TreeView which also plots nice heatmaps. Hope this helps!

Thanks for the answer...I actually have followed the same path, but I have just been able to produce a differential expression heatmap.

I'll see what I can do for up and down regulation...unfortunately java is too hardcore informatics for me!!!
Thanks
Manu

**devking** · 06-17-2013, 12:39 PM

Maybe this can help you get started:

Code:

library(cummeRbund); library(gplots)
cuff <- readCufflinks("cuffdiff_out",rebuild=T)
db <- fpkmMatrix(genes(cuff))
sigGenes <- getSig(cuff,'BY4741','ino80',alpha=0.05,level='genes')
db <- db[sigGenes,]	


WT <- "BY4741" # Set the column name of the WT sample
mut <- c("ino80","arp5","ies6") # Set column names of test conditions you want in the analysis

logFC <- function(db,mutants,WT,logBase=2,pseudo=1) {
		if (length(WT) !=1 ) {
			stop('WT must refer to a single gene/column')
			}
		if (is.numeric(logBase)==FALSE) {
			stop('logBase must be a numeric value')
			}
		
		db <- (db[,mutants]+pseudo)/(db[,WT]+pseudo)
		db <- log(db,logBase)
		
}

db <- logFC(db,mutants=mut,WT=WT,logBase=2,pseudo=1) # This does the log transformation
db <- as.matrix(db)

heatmap.2(
		db,
		Rowv=TRUE,
		Colv=FALSE,
		dendrogram="row",
		trace="none",
		labRow="",
		density.info=c("none"),
		main="Log(Fold-Change) \nExpression Profiles"
	)

**emolinari** · 06-18-2013, 08:11 AM

Thanks Devking!
This really helps me out!!!

**jp.** · 07-22-2013, 03:40 AM

Hi
Can you please give me your commands to make csHeatmap using cummeRbund ?
I tried making the using cummeRbund and succeeded, however, can not add dandogram and change color which you have done it.
May you please write back to me in little detail because I am a beginer.
Thank you in advance.
Jp.

Originally posted by devking View Post

Hi Tir,

Thanks for your response! I have been using cummeRbund and it has been very easy to use so far to make heatmaps of gene expression. But I didn't immediately see a way to look at *fold-change* expression between two conditions. (I've never used something like R before so things that should be simple usually aren't for me

). However, on the advice of one of the cummeRbund authors, Loyal, I was able to make log fold-change heat maps by extracting raw FPKM values from fpkmMatrix(), do the log2FC transformation in R, then use heatmap.2() from gplots to create something like the attached file.

**jp.** · 10-30-2013, 10:49 PM

Hi
Would you post your commands to make dendrogram in csHeatmap ?
I just can not add dendrogram using csHeatmap.
You used heatmap.2(.... can read cuffdiff out and make heatmap with dendrogram ?
Thank you

Originally posted by devking View Post

Hi Tir,

Thanks for your response! I have been using cummeRbund and it has been very easy to use so far to make heatmaps of gene expression. But I didn't immediately see a way to look at *fold-change* expression between two conditions. (I've never used something like R before so things that should be simple usually aren't for me

). However, on the advice of one of the cummeRbund authors, Loyal, I was able to make log fold-change heat maps by extracting raw FPKM values from fpkmMatrix(), do the log2FC transformation in R, then use heatmap.2() from gplots to create something like the attached file.

**hubery_Bio** · 11-07-2013, 06:22 AM

maybe the program cluster can help u. you just need to change the format of the cuffdiff result.

Topics	Statistics	Last Post
Study Highlights Challenges in Cellular Reprogramming for Regenerative Medicine by seqadmin Started by seqadmin, Today, 06:25 AM	0 responses 13 views 0 likes	Last Post by seqadmin Today, 06:25 AM
New DNA Modification Discovered as Key to Gene Activation in Early Development by seqadmin Started by seqadmin, Yesterday, 01:02 PM	0 responses 12 views 0 likes	Last Post by seqadmin Yesterday, 01:02 PM
Wastewater Analysis Unlocks New Method for Identifying Public Health Threats by seqadmin Started by seqadmin, 09-18-2024, 06:39 AM	0 responses 14 views 0 likes	Last Post by seqadmin 09-18-2024, 06:39 AM
Molecular Markers Shared Across Dementias by seqadmin Started by seqadmin, 09-11-2024, 02:44 PM	0 responses 14 views 0 likes	Last Post by seqadmin 09-11-2024, 02:44 PM

Seqanswers Leaderboard Ad

Announcement

A fold change heatmap for RNA seq analysis using CuffDiff and cummerbund

Comment

Comment

Comment

Comment

Comment

Comment

Comment

Comment

Comment

Comment

Comment

Comment

Comment

Comment

Latest Articles

ad_right_rmr

News