Hello all,
I am running DESeq2 like so in R:
And I am getting a PCA plot that looks like so where 138 genes are padj <0.05 between the blue and red conditions.

I would expect for the blue replicates to be clustered and the red as well. Given that there were a fair amount of significant genes, I think that I a plotting this PCA wrong.
When I check the columns to make sure I am using the right I get this:
Is this something to be concerned about or is this the wrong way to plot PCA?
Thanks in advance
-R
I am running DESeq2 like so in R:
Code:
library(DESeq2) sTable = data.frame(sampleName = files, fileName = files, condition = cond) dds <- DESeqDataSetFromHTSeqCount(sampleTable = sTable, directory = "", design = ~condition) dds <- DESeq(dds) res <- results(dds) resOrdered <- res[order(res$padj),] rld <- rlogTransformation(dds, blind=TRUE) print(plotPCA(rld, intgroup="condition"))

I would expect for the blue replicates to be clustered and the red as well. Given that there were a fair amount of significant genes, I think that I a plotting this PCA wrong.
When I check the columns to make sure I am using the right I get this:
Code:
> colData(dds) DataFrame with 6 rows and 2 columns condition <factor> ID_18_1.bam_sorted.bam_htseq_out.txt ID18 ID_18_2.bam_sorted.bam_htseq_out.txt ID18 ID_18_3.bam_sorted.bam_htseq_out.txt ID18 GP_18_1.bam_sorted.bam_htseq_out.txt GP18 GP_18_2.bam_sorted.bam_htseq_out.txt GP18 GP_18_3.bam_sorted.bam_htseq_out.txt GP18
Thanks in advance
-R
Comment