TopHat/Cufflinks without replicates

FoRsUs

Junior Member

Join Date: Jul 2011

Posts: 3
- Share
- Tweet
#1

TopHat/Cufflinks without replicates

02-05-2013, 02:36 AM

Hi everyone,

I want to share my pipeline to analyze RNAseq data without replicates because I'm not finding the expected results, and I ask for help in order to improve it.

My experimental design consist of a time-series experiment without replicates and 2 time-points: T0 and 6h hours of treatment using r5020 hormone (R6h).
So for this analysis I have 2 samples that I want to compare.

Furthermore I followed TopHat/cufflinks nature protocol paper and found less than expected differentially expressed genes at R6h compared to T0.
In our lab we know by previous microarray experiments that after 6h of treatment we could end up with almost 4000 regulated genes, but using RNAseq analysis I found more or less 600 regulated genes.

A good point of this is that genes found by RNAseq data seems to have a good correlation with microarray data, but I was expecting to find more genes.

After this results I search a bit on internet to see which kind of answers people publish related to this, but didn't find any clear answer.
I was wondering if the absence of replicates could affect dramatically the differential gene analysis, so I want to ask your opinion about this.

And also I paste here the bash code I use to run tophat/cufflinks to know if i'm doing it right (I hope so ).

Finally, Hope this question could serve anybody else doing RNAseq analysis without replicates

Code:

#!/bin/bash cuffdiff=/soft/bin/cuffdiff cufflinks=/soft/bin/cufflinks cuffmerge=/soft/bin/cuffmerge genes=~/daniel/tracks/human_genome_19/Homo_sapiens/UCSC/hg19/Annotation/Genes/genes.gtf genome=~/daniel/tracks/human_genome_19/Homo_sapiens/UCSC/hg19/Sequence/Bowtie2Index/genome genomeFA=~/daniel/tracks/human_genome_19/Homo_sapiens/UCSC/hg19/Sequence/WholeGenomeFasta/genome.fa mask=~/projects/rna-seq/large/analysis_4/mask.gtf tophat=/soft/bin/tophat transcriptome=~/daniel/RNAseq/New_analysis/Guille_samples/mapping-assembly/transcriptome_data/known ## T0 sample analysis $tophat -p 8 -G $genes --transcriptome-index=$transcriptome -o T0 --no-coverage-search -g 10 -r 98 $genome T0.read1 T0.read2 $cufflinks -p 8 -o T0 -M $mask --no-update-check T0/T0.accepted_hits.bam ## R6h sample analysis $tophat -p 8 -G $genes --transcriptome-index=$transcriptome -o R6h --no-coverage-search -g 10 -r 98 $genome R6h.read1 R6h.read2 $cufflinks -p 8 -o R6h -M $mask --no-update-check R6h/R6h.accepted_hits.bam ## Merge assemblies $cuffmerge -g $genes -s $genomeFA -p 8 assemblies.txt ## Start differential expression analysis $cuffdiff -o diff_out -b $genomeFA -p 8 -L T0,R6h -u merged_asm/merged.gtf T0/T0.accepted_hits.bam R6h/R6h.accepted_hits.bam -T -M $mask ##-------- CummeRbund R analysis ------- ##

Thanks everybody wish to contribute.

Last edited by FoRsUs; 02-05-2013, 02:42 AM.
Tags: cuffdiff, cufflinks, cuffmerge, rnaseq, tophat
dcchauss

Junior Member

Join Date: Feb 2013

Posts: 1
- Share
- Tweet
#2

02-06-2013, 01:26 PM

I believe the problem is that without replicates the estimated variance for each gene is the same for all genes, so you will only get very few statistically significant differentially expressed genes without replicates....there have been many posts on these forums about this..

Please see this thread: http://seqanswers.com/forums/showthr...cates+cuffdiff

Last edited by dcchauss; 02-07-2013, 11:20 AM.
Comment

Previous template Next

Reply to Nine Things a Sample Prep Scientist Thinks About Before Sequencing

by GATTACAT

Love this - good data definitely starts from good input, and poor input can only give relatively poor data. I particularly like the mention of Nanodrop/absorbance based methods for quantification. It's such a toss up if you'll get an accurate reading or what amounts to a randomly generated number, and a lot of library/sequencing related issues can be traced back to poor quant.
- Channel: Articles
07-01-2026, 11:43 AM
Nine Things a Sample Prep Scientist Thinks About Before Sequencing

by SEQadmin2

I’m not a sequencing expert. I’m a purification scientist who uses NGS to evaluate workflows my group develops. With this perspective, we think about the sample first and the NGS workflow second. The sequencer is an exceptionally honest reporter, but it can only report on what you give it, so whether you get clean, interpretable data from an NGS workflow is largely determined before you begin.

Here are nine questions we think about, in roughly the order they matter, before...
- Channel: Articles
06-18-2026, 07:11 AM

Topics	Statistics	Last Post
High-Resolution Sequencing Exposes Hidden Toxoplasma Diversity by SEQadmin2 Started by SEQadmin2, 07-02-2026, 11:08 AM	0 responses 7 views 0 reactions	Last Post by SEQadmin2 07-02-2026, 11:08 AM
New AI Model Captures Long-Range Genomic Signals to Improve RNA Splice Site Prediction by SEQadmin2 Started by SEQadmin2, 06-30-2026, 05:37 AM	0 responses 12 views 0 reactions	Last Post by SEQadmin2 06-30-2026, 05:37 AM
Large-Scale Protein Screen Uncovers Hidden Regulators of Alternative Polyadenylation by SEQadmin2 Started by SEQadmin2, 06-26-2026, 11:10 AM	0 responses 20 views 0 reactions	Last Post by SEQadmin2 06-26-2026, 11:10 AM
Whole-Genome Sequencing Traces Faroe Islands Ancestry to a North Atlantic Founder Population by SEQadmin2 Started by SEQadmin2, 06-17-2026, 06:09 AM	0 responses 54 views 0 reactions	Last Post by SEQadmin2 06-17-2026, 06:09 AM

Unconfigured Ad

TopHat/Cufflinks without replicates

Comment

Latest Articles

ad_right_rmr

News