I was wondering if folks had any advice on the simplest way to generate a principal component analysis or MDS plot with Cufflinks output. I could group all my Cufflinks output into a data matrix and then try to visualize it but I was wondering if there was a built in way in R (perhaps through a tool or package like cummerbun) Thanks -Rich
Seqanswers Leaderboard Ad
Collapse
Announcement
Collapse
No announcement yet.
X
-
Originally posted by greener View PostI was wondering if folks had any advice on the simplest way to generate a principal component analysis or MDS plot with Cufflinks output. I could group all my Cufflinks output into a data matrix and then try to visualize it but I was wondering if there was a built in way in R (perhaps through a tool or package like cummerbun) Thanks -Rich
Awesome idea, and one that can be very easily implemented with cummeRbund. If you run a cuffdiff on all of your samples, cummeRbund will take care of aggregating the information into a SQLite db behind the scenes. The generation of the FPKM matrix is very simple once this is complete:
Code:>library(cummeRbund) >cuff<-readCufflinks() >allGeneFPKMs<-fpkmMatrix(genes(cuff)) #You can of course use the transpose if you want samples instead of genes >genes.pca<-prcomp(allGeneFPKMs) >biplot(genes.pca) >allIsoformFPKMs<-fpkmMatrix(isoforms(cuff)) >isoforms.pca<-prcomp(allIsoformFPKMs)
Please let me know how this works out for you. It may be something that I would like to integrate into cummeRbund for the future.
Cheers,
Loyal
-
That's great, the plot generated by the MDSplot command is very useful. It would be nice to be able to color samples by arbitrary experimental conditions, for example, to be able to make all replicates of one sample the same color.
It is easy enough to run each command in the MDSplot function manually and supply a different "names" vector to the geom color option:
customnames<- c("Wildtype", "Wildtype", "Wildtype", "Mutant", "Mutant", "Mutant")
p <- p + geom_point(aes(x=M1,y=M2,color=customnames)) + geom_text(aes(x=M1,y=M2,label=names,color=customnames)) + theme_bw()
But it would be nice to have that capability built into the function.
Comment
Latest Articles
Collapse
-
by seqadmin
The field of epigenetics has traditionally concentrated more on DNA and how changes like methylation and phosphorylation of histones impact gene expression and regulation. However, our increased understanding of RNA modifications and their importance in cellular processes has led to a rise in epitranscriptomics research. “Epitranscriptomics brings together the concepts of epigenetics and gene expression,” explained Adrien Leger, PhD, Principal Research Scientist...-
Channel: Articles
04-22-2024, 07:01 AM -
-
by seqadmin
Proteins are often described as the workhorses of the cell, and identifying their sequences is key to understanding their role in biological processes and disease. Currently, the most common technique used to determine protein sequences is mass spectrometry. While still a valuable tool, mass spectrometry faces several limitations and requires a highly experienced scientist familiar with the equipment to operate it. Additionally, other proteomic methods, like affinity assays, are constrained...-
Channel: Articles
04-04-2024, 04:25 PM -
ad_right_rmr
Collapse
News
Collapse
Topics | Statistics | Last Post | ||
---|---|---|---|---|
Started by seqadmin, 05-02-2024, 08:06 AM
|
0 responses
16 views
0 likes
|
Last Post
by seqadmin
05-02-2024, 08:06 AM
|
||
Started by seqadmin, 04-30-2024, 12:17 PM
|
0 responses
20 views
0 likes
|
Last Post
by seqadmin
04-30-2024, 12:17 PM
|
||
Started by seqadmin, 04-29-2024, 10:49 AM
|
0 responses
25 views
0 likes
|
Last Post
by seqadmin
04-29-2024, 10:49 AM
|
||
Started by seqadmin, 04-25-2024, 11:49 AM
|
0 responses
28 views
0 likes
|
Last Post
by seqadmin
04-25-2024, 11:49 AM
|
Comment