Originally posted by Jeremy
View Post
Header Leaderboard Ad
Collapse
Biological replicates for RNA-seq
Collapse
Announcement
Collapse
No announcement yet.
X
-
Originally posted by eastasiasnow View Postyeah, pooling biological replicate samples will lose group variance. but could I use this design to do the following analysis? do people accept this design when I apply it in my paper? if so, what kind of tools can do this?
thank you very much.
Leave a comment:
-
Originally posted by Jeremy View PostBased on my quote marks I think I was asking the OP what they meant and then pointing out (via rhetorical question) that you can't get within group variance using a pooled approach. But it was so long ago I can't remember and the phrase that I quoted seems to no longer be there.
thank you very much.
Leave a comment:
-
Originally posted by eastasiasnow View Posthi Jeremy, have you got the answer of your concern?
Leave a comment:
-
Originally posted by Jeremy View PostBy "mix multiple biological replicates in the same barcoded sample", do you mean each sample has its own barcode and is pooled together and sequenced. Or do you mean that multiple samples are pooled together, given the same barcode and then sequenced?
If you mix samples and give them the same barcode, how do you calculate variance?
Leave a comment:
-
Cuffdiff - differential expression analysis between groups of samples
This criticism also applies to cuffdiff, at least to the version described in the paper. (There is a new version of cuffdiff that allows for biological replicates but there is no documentation on its method yet, and hence it is unclear whether it now asks the relevant question.)
Simon mentioned the existence of a new version of Cuffdiff that allows for biological replicates. Does anyone know anything else about this new version? Will it be released soon or is it already available somewhere?
Given the currently available Cuffdiff version (v0.9.3), is there any viable workaround to analyze groups of samples (e.g. control samples compared with treated samples)?
Thank you,
Alexandra
Leave a comment:
-
Originally posted by vpp605 View PostSimon - sorry for making you repeat everything over again -- if there is a "better" post for me to look at, please point me in that direction!
Here are a few of them:
Discussion of next-gen sequencing related bioinformatics: resources, algorithms, open source efforts, etc
Discussion of next-gen sequencing related bioinformatics: resources, algorithms, open source efforts, etc
Application of sequencing to RNA analysis (RNA-Seq, whole transcriptome, SAGE, expression analysis, novel organism mining, splice variants)
Application of sequencing to RNA analysis (RNA-Seq, whole transcriptome, SAGE, expression analysis, novel organism mining, splice variants)
Note that some of the mentioned software packages have got new functionality quite recently, i.e., some arguments in these threads about their limitations are out of date.
Simon
Leave a comment:
-
Thank you everyone for the replies!!
golharam - thank you for pointing me to that paper!
Simon - sorry for making you repeat everything over again -- if there is a "better" post for me to look at, please point me in that direction!
Leave a comment:
-
I've explained this in a number of posts before, so I just repeat the core points.
- If you use FIsher's exact test or something similar, you don't need any replicates because it cannot accommodate for them. The results, though, will be wrong, especially for strongly expressed genes.
This is because Fisher's exact test tests whether two samples differ in the concentration of a given transcript. This is, however, not the question you want to ask. What you want to know is whether the difference between two samples with different treatment is stronger than what you expect to see between two samples that are replicates, because otherwise, you cannot attribute the difference to the treatment.
This criticism also applies to cuffdiff, at least to the version described in the paper. (There is a new version of cuffdiff that allows for biological replicates but there is no documentation on its method yet, and hence it is unclear whether it now asks the relevant question.)
- If you have many replicates, use a t test.
- With only two or three replicates, you need to pool across genes, i.e., assume that similar genes have similar variance. Our DESeq package assumes that genes with similar expression strength have similar variance, and so pools information from these in order to get a reasonable estimate of biological variability, which is then used for the test.
Simon
Leave a comment:
-
We always recommend at least 3 biological replicates. If you do two, how do you know one isn't bad? If you do three, and one is bad, you can at least eliminate it and continue.
I think 4 would be ideal especially from a statistical standpoint, but that's not always possible because of cost. However, talk with your sequencing core facility to determine if barcoding multiple samples is an option. If it is, you may be able to sequence more samples for the same cost.
Take a look at this paper as well:
Statistical Design and Analysis of RNA Sequencing Data
Genetics, Vol. 185, No. 2. (1 June 2010), pp. 405-416.
Leave a comment:
-
By "mix multiple biological replicates in the same barcoded sample", do you mean each sample has its own barcode and is pooled together and sequenced. Or do you mean that multiple samples are pooled together, given the same barcode and then sequenced?
If you mix samples and give them the same barcode, how do you calculate variance?
Leave a comment:
-
mrawlins, can you suggest an article that explains the difference between these different statistical tests and why they require different numbers of biological replicates of RNA-seq data to be as powerful?
I've been reading the Cufflinks paper and trying to understand their statistical model used to analyze RNA-seq data, as written in the Supplementary Methods (Trapnell et al., 2010, in Nature Biotechnology). Can someone explain it to me in simple terms with as little math as possible?
Leave a comment:
-
like mrawlins, we've mixed multiple biological replicates in the same barcoded sample. but we're working on small rna seq.
It works great
Leave a comment:
-
This will depend on what statistics you use to determine statistical significance.
If you are using something like a T-test, you really want as many replicates as possible. You can't do these tests without at least 3 replicates. Depth isn't particularly useful here, but more replicates are. It's an oversimplified approach to the statistics, IMO.
If you use something like Fisher's Exact Test (hypergeometric or poisson distribution) then two biological replicates should be reasonable. You can actually mix multiple biological replicates in the same barcoded sample and it will likely give the same answer (since the reads from replicates are just added together). In this case the read depth is more useful than additional replicates. This method is what we use, but a number of people (probably more knowledgeable than me) have raised concerns with it.
If you use something like DESeq I don't have an answer for you, because I don't know anything about their statistical models. My guess is that two biological replicates would be fine for this type of analysis, though you may need 3.
I think this is a useful question, though, and I am interested to see what others think.
Leave a comment:
Latest Articles
Collapse
-
by seqadmin
At the intersection of cytogenetics and genomics lies the exciting field of cytogenomics. It focuses on studying chromosomes at a molecular scale, involving techniques that analyze either the whole genome or particular DNA sequences to examine variations in structure and behavior at the chromosomal or subchromosomal level. By integrating cytogenetic techniques with genomic analysis, researchers can effectively investigate chromosomal abnormalities related to diseases, particularly...-
Channel: Articles
Today, 06:26 AM -
-
by seqadmin
Cancer research has been transformed through numerous molecular techniques, with RNA sequencing (RNA-seq) playing a crucial role in understanding the complexity of the disease. Maša Ivin, Ph.D., Scientific Writer at Lexogen, and Yvonne Goepel Ph.D., Product Manager at Lexogen, remarked that “The high-throughput nature of RNA-seq allows for rapid profiling and deep exploration of the transcriptome.” They emphasized its indispensable role in cancer research, aiding in biomarker...-
Channel: Articles
09-07-2023, 11:15 PM -
-
by seqadmin
Ribonucleic acid (RNA) represents a range of diverse molecules that play a crucial role in many cellular processes. From serving as a protein template to regulating genes, the complex processes involving RNA make it a focal point of study for many scientists. This article will spotlight various methods scientists have developed to investigate different RNA subtypes and the broader transcriptome.
Whole Transcriptome RNA-seq
Whole transcriptome sequencing...-
Channel: Articles
08-31-2023, 11:07 AM -
ad_right_rmr
Collapse
News
Collapse
Topics | Statistics | Last Post | ||
---|---|---|---|---|
Started by seqadmin, Today, 07:53 AM
|
0 responses
7 views
0 likes
|
Last Post
by seqadmin
Today, 07:53 AM
|
||
Multiplexed Biomarker Detection with Nanopore Technology: A Leap in Precision Diagnostics
by seqadmin
Started by seqadmin, Yesterday, 07:42 AM
|
0 responses
11 views
0 likes
|
Last Post
by seqadmin
Yesterday, 07:42 AM
|
||
Started by seqadmin, 09-22-2023, 09:05 AM
|
0 responses
25 views
0 likes
|
Last Post
by seqadmin
09-22-2023, 09:05 AM
|
||
Started by seqadmin, 09-21-2023, 06:18 AM
|
0 responses
18 views
0 likes
|
Last Post
by seqadmin
09-21-2023, 06:18 AM
|
Leave a comment: