Seqanswers Leaderboard Ad
Collapse
Announcement
Collapse
No announcement yet.
X
-
Yes, batch effects can be quite large on occasion (or quite small, you never know). Have a look at the SVA package on bioconductor.
-
Thanks for all!I think I have understand the important of biological replicates ,and I will use some magnitude threshold,or some biological methods to find the genes which I interested in.
On the other hand,I want to get the raw counts about Prostate cancer from TCGA,then use DEseq and edgeR to find differential expression genes,I don't know whether I can get the raw data from TCGA.
I get a new question ,my 180 sample from TCGA is from different batch ,is there a impact to our analysis ?
Leave a comment:
-
The more samples you use, the more power you have. That's a completely expected result. With enough samples you'd likely find almost everything to be at least slightly different between the two samples.
"PCA didn't work well" is meaningless. Perhaps the samples clustered by group or perhaps not, but in neither case can one say that the PCA itself didn't work well.
Leave a comment:
-
Any time that you increase the number of replicates, you will likely detect more significant results. That is the whole point of replication. The more replicates, the more precise is your estimate of the population mean and variance, and hence the smaller the change you can now detect as significant (i.e. unlikely to be observed by chance). Statistical significance is all about your ability to estimate population mean and variance, and the more replicates you have, the better your estimates of those parameters.
This is the very reason why biological replicates are so important in detecting differential gene expression. The more replicates, the greater your ability to detect ever more subtle changes in gene expression.
Unless you have some rational reason to reject a sample, you should always use all of your biological replicates when testing for differential gene expression - you have the most power to discriminate differences then.
When selecting differentially expressed genes though, you should NOT rely purely on statistical significance. Many published studies have clearly shown that you will get your most reliable results if you simultaneously use both a statistical threshold (e.g. FDR<0.05 is common), AND a magnitude threshold (e.g. absolute value of estimated fold change >1.5, or >2.0 are common cutoffs).
Genes selected by simultaneously applying a statistical, AND a magnitude threshold are the most likely to validate via an independent method such as RT-qPCR.
Leave a comment:
-
Thank you for your reply!
The data which I got is 180 samples ,contain 141 tumor samples and 39 normal samples .I have analysis the data with LIMMA and T-Test,when I use LIMMA I choose 3 tumor and 3 normal to analysis ,I find about 80 different expressed genes(diffgenes),then I choos 10 tumor and 10 normal,I get about 2000 diffgenes ,finally I use all the tumor and normal samples ,unintelligibly,I get 10,000 diffgenes.
I don't know why this phenomenon happen ?
Should I do some cluster or PCA before analysis (Actually,I have done these method ,but cluster and PCA didn't work well,I can't cluster all the 180 sample perfectly),is there any other method to do this work?
I'm a junior ,thank you for your help !
Leave a comment:
-
Starting with normalized data, you can use LIMMA, T-Tests or ANOVA. Of course, without biological replicates, you cannot do any statistical analysis of differential gene expression at all, so I'm assuming you have at least 2 biological replicates for each condition (and really, 3 should be the bare minimum acceptable for any reliable stats).
Leave a comment:
-
How to deal with normalized RNA seq data?
Hi everyone !
I have got the TCGA data about prostatic cancer,the data has been normalized.Now I want to analysis it and find some different expressed genes.
my question is what method can I use except limma and t.test?
Thank you !
Latest Articles
Collapse
-
by seqadmin
Like all molecular biology applications, next-generation sequencing (NGS) workflows require diligent quality control (QC) measures to ensure accurate and reproducible results. Proper QC begins at nucleic acid extraction and continues all the way through to data analysis. This article outlines the key QC steps in an NGS workflow, along with the commonly used tools and techniques.
Nucleic Acid Quality Control
Preparing for NGS starts with isolating the...-
Channel: Articles
02-10-2025, 01:58 PM -
-
by seqadmin
In recent years, precision medicine has become a major focus for researchers and healthcare professionals. This approach offers personalized treatment and wellness plans by utilizing insights from each person's unique biology and lifestyle to deliver more effective care. Its advancement relies on innovative technologies that enable a deeper understanding of individual variability. In a joint documentary with our colleagues at Biocompare, we examined the foundational principles of precision...-
Channel: Articles
01-27-2025, 07:46 AM -
ad_right_rmr
Collapse
News
Collapse
Topics | Statistics | Last Post | ||
---|---|---|---|---|
Genetic Mapping of Plasmodium knowlesi Identifies Essential Genes and Drug Resistance Mechanisms
by seqadmin
Started by seqadmin, 02-07-2025, 09:30 AM
|
0 responses
72 views
0 likes
|
Last Post
by seqadmin
02-07-2025, 09:30 AM
|
||
Started by seqadmin, 02-05-2025, 10:34 AM
|
0 responses
113 views
0 likes
|
Last Post
by seqadmin
02-05-2025, 10:34 AM
|
||
Started by seqadmin, 02-03-2025, 09:07 AM
|
0 responses
90 views
0 likes
|
Last Post
by seqadmin
02-03-2025, 09:07 AM
|
||
Started by seqadmin, 01-31-2025, 08:31 AM
|
0 responses
49 views
0 likes
|
Last Post
by seqadmin
01-31-2025, 08:31 AM
|
Leave a comment: