Announcement

Collapse
No announcement yet.
X
 
  • Filter
  • Time
  • Show
Clear All
new posts

  • Different DESeq2 results (both FC and pvalue) by changing col/row orders in input

    I'm getting different p-values and FCs from DESeq2 (v1.34.0) by simply changing the order of the columns in the count table. I can't understand why. I made sure the cols of count table match the rows of colData in both.

    The differences in this dataset are small ([-0.003, 0.003] for p-value, and [-0.002, 0.001] for FC). But in some other datasets, the difference of log2FC can be up to 0.8 (and the baseMean for that gene is 300), which is concerning.

    I tried setting `betaPrior = FALSE` but still got the difference.

    Column `Genotype` in colData is the only tested variable in the model. It contains `WT` and `Mutant` as factors, with `WT` as the 1st level.

    Actually if I change column `Genotype` to character type and rerun DESeq2 (so DESeq2 will convert them to factors with the warning message), I get the same results independent of the col/row orders in the input. So it seems the difference is somehow related to the factorization of the columns in colData, but I don't know why, and which results I should trust.



    Code:
    data <- readRDS('data.rds')
    
    dds1 <- DESeqDataSetFromMatrix(countData = data[['counts']], colData = data[['colData']], design = ~ Genotype)
    dds1 <- dds1[rowSums(counts(dds1)) > 1, ]
    dds1 <- DESeq(dds1, betaPrior = TRUE)
    res1 <- as.data.frame(results(dds1, contrast = list("GenotypeMutant","GenotypeWT")))
    
    # only reorder the inputs, but the columns of counts still match the rows of colData:
    dds2 <- DESeqDataSetFromMatrix(countData = data[['counts']][,c(4:6,1:3)], colData = data[['colData']][c(4:6,1:3),], design = ~ Genotype)
    dds2 <- dds2[rowSums(counts(dds2)) > 1, ]
    dds2 <- DESeq(dds2, betaPrior = TRUE)
    res2 <- as.data.frame(results(dds2, contrast = list("GenotypeMutant","GenotypeWT")))
    
    res1['ENSMUSG00000069515',]
    res2['ENSMUSG00000069515',]


    Code:
    baseMean log2FoldChange lfcSE stat pvalue
    ENSMUSG00000069515 3043.921 1.142537 0.09271227 12.32347 6.771949e-35
    padj
    ENSMUSG00000069515 4.605602e-31
    
    
    baseMean log2FoldChange lfcSE stat pvalue
    ENSMUSG00000069515 3043.921 1.142512 0.09272373 12.32168 6.923947e-35
    padj
    ENSMUSG00000069515 4.708976e-31

Latest Articles

Collapse

  • seqadmin
    Advanced Tools Transforming the Field of Cytogenomics
    by seqadmin


    At the intersection of cytogenetics and genomics lies the exciting field of cytogenomics. It focuses on studying chromosomes at a molecular scale, involving techniques that analyze either the whole genome or particular DNA sequences to examine variations in structure and behavior at the chromosomal or subchromosomal level. By integrating cytogenetic techniques with genomic analysis, researchers can effectively investigate chromosomal abnormalities related to diseases, particularly...
    09-26-2023, 06:26 AM
  • seqadmin
    How RNA-Seq is Transforming Cancer Studies
    by seqadmin



    Cancer research has been transformed through numerous molecular techniques, with RNA sequencing (RNA-seq) playing a crucial role in understanding the complexity of the disease. Maša Ivin, Ph.D., Scientific Writer at Lexogen, and Yvonne Goepel Ph.D., Product Manager at Lexogen, remarked that “The high-throughput nature of RNA-seq allows for rapid profiling and deep exploration of the transcriptome.” They emphasized its indispensable role in cancer research, aiding in biomarker...
    09-07-2023, 11:15 PM

ad_right_rmr

Collapse

News

Collapse

Topics Statistics Last Post
Started by seqadmin, Today, 07:14 AM
0 responses
5 views
0 likes
Last Post seqadmin  
Started by seqadmin, 09-29-2023, 09:38 AM
0 responses
12 views
0 likes
Last Post seqadmin  
Started by seqadmin, 09-27-2023, 06:57 AM
0 responses
13 views
0 likes
Last Post seqadmin  
Started by seqadmin, 09-26-2023, 07:53 AM
0 responses
30 views
0 likes
Last Post seqadmin  
Working...
X