Unconfigured Ad

**chadn737** · 06-06-2012, 07:35 AM

Can you post an example?

**Marianna85** · 06-06-2012, 07:46 AM

Here you can find some significative results.

Attached Files

DEseqExample.txt (1.1 KB, 14 views)

**chadn737** · 06-06-2012, 07:51 AM

Nothing wrong with your results, you are just reading it wrong. You need to shift the header column over one.

Code:

shiftOver->	id	baseMean	baseMeanA	baseMeanB	foldChange	log2FoldChange	pval	padj	
25916	Contig25916	93.49866325	93.49325776	93.50406873	1.000115634	0.000166814	1	1
18832	Contig18832	94.49867786	94.48786689	94.50948883	1.000228833	0.000330098	1	1
60472	larve17dpf_CGATGT_L002_R1_001_(paired)_trimmed_(paired)_contig_9475	188.9973557	188.9757338	189.0189777	1.000228833	0.000330098	1	1
125763	larve48dpf_ACAGTG_L002_R1_001_(paired)_trimmed_(paired)_contig_301	116.3746731	231.7439262	1.005420094	0.004338496	-7.84858929	0.000687227	0.971172987
218221	rud_dec2_c2353	301.3827814	599.7493025	3.016260282	0.005029202	-7.635454836	7.03E-05	0.24657811
56313	larve17dpf_CGATGT_L002_R1_001_(paired)_trimmed_(paired)_contig_642	393.3895309	782.7573815	4.021680376	0.005137838	-7.60462297	3.66E-05	0.177915006
130823	larve48dpf_ACAGTG_L002_R1_001_(paired)_trimmed_(paired)_contig_11355	82.55796287	164.1105056	1.005420094	0.006126482	-7.350725359	0.001972149	1
159688	onemoutholdseed_CTTGTA_L002_R1_001_(paired)_trimmed_(paired)_contig_5425	329.7345469	655.4474135	4.021680376	0.006135779	-7.3485378	7.80E-05	0.250145666

Notice that the very last column is your padj and the column immediately next to it is the pvalue. The negative values are the log2fold changes, not the pvalues.

Easy solution, open your results in Excel and move the top row over one column and I think your results will be far more meaningful to you.

**Marianna85** · 06-06-2012, 08:45 AM

AHHHHHHHHHHH

probably there was a problem in the convertion from txt to excel...
now it's clear...unfortunately I only have 10 genes with a padj <0.01.

Thank you very much!

**mbblack** · 06-06-2012, 09:09 AM

Originally posted by Marianna85 View Post

AHHHHHHHHHHH

probably there was a problem in the convertion from txt to excel...
now it's clear...unfortunately I only have 10 genes with a padj <0.01.

Thank you very much!

With no replicates, you have no statistical discrimination. All you are doing is effectively comparing the difference between single pairs of numbers, so even those 10 with a "significant" adjusted P-value are highly suspect.

Differential gene expression without replication simply cannot be done. Would you do population genetics on allele frequencies from samples with N of 1? Of course not. You may be able to say which of your genes had different count values between your samples, but you cannot assign any significance (statistical or biological) to those differences.

Back in the early days of microarrays, it seems we went through the same thing. Everyone trying to squeeze meaning out of array experiments with single samples in each comparison group. It did not work then, and it does not work now.

**Marianna85** · 06-06-2012, 11:44 PM

mBblack I agree with you. I know that without replicates any reliable differences can be detected.
The two libraries are pools of several larvae and the aim of the study was mainly to obtain informations about the transcriptome. I red that it could be possible to make a differential espression even without replicates and I decided to try. But the only thing that I can reliably say is that there are few genes with a high fold change between the two samples but further evaluations are needed.

Thank you

**mbblack** · 06-07-2012, 04:05 AM

I understand, but even fold change in that instance is highly suspect. Fold change is just relative difference. In your case, you have single pairs of numbers, with no idea at all of how much variation there would normally be around those numbers, so your estimates of fold change tell you nothing about what the actual differences might or might not be between your treatments.

Again, this is the same debate that went on 10-12 years ago with arrays, when people argued for the use of fold change to base biological interpretation from non-replicate experiments. But fold change as an estimate of difference is just as subject to properly characterizing variation within each population as is statistical tests of difference. Without replicates to measure variation within each group, you have no idea of what the true degree of difference between them is. Your large fold changes may merely be due to pure chance in your selection of specimens for sequencing. Had you done replicates, those "large" fold changes, when based on robust mean difference, may turn out to be trivial (especially if those genes are as varied within any treatment or strain as they are between them).

Topics	Statistics	Last Post
Sequencing the Two-Toed Sloth Genome Reveals Jumping Genes Tied to Its Extreme Metabolism by SEQadmin2 Started by SEQadmin2, 06-09-2026, 11:58 AM	0 responses 23 views 0 reactions	Last Post by SEQadmin2 06-09-2026, 11:58 AM
A New Method Makes Hantavirus Genome Analysis Faster and More Accessible by SEQadmin2 Started by SEQadmin2, 06-05-2026, 10:09 AM	0 responses 29 views 0 reactions	Last Post by SEQadmin2 06-05-2026, 10:09 AM
A New Single-Cell Method Maps DNA-Protein Interactions by SEQadmin2 Started by SEQadmin2, 06-04-2026, 08:59 AM	0 responses 39 views 0 reactions	Last Post by SEQadmin2 06-04-2026, 08:59 AM
Long-Read RNA Sequencing Uncovers a Hidden Layer of Immune Cell Regulation by SEQadmin2 Started by SEQadmin2, 06-02-2026, 12:03 PM	0 responses 61 views 0 reactions	Last Post by SEQadmin2 06-02-2026, 12:03 PM

Unconfigured Ad

negative p-value in DEseq analysis

Comment

Comment

Comment

Comment

Comment

Comment

Comment

Latest Articles

ad_right_rmr

News