Cuffmerge and Cuffquant questions - help please!

yehudithasin

Member

Join Date: Jan 2011

Posts: 14
- Share
- Tweet
#1

Cuffmerge and Cuffquant questions - help please!

06-27-2014, 05:08 AM

Hello wise people

I need help. I am analyzing ~100 samples of mouse RNAseq, Cufflinks version 2.2.1. Aligned with STAR, run through cufflinks, and merged the transcripts.gtf files using cuffmerge. The resultant gtf file is ~1G, with 41K genes/loci and ~380K transcripts.
Question one:
Is it too much? Do you usually filter the transcripts in transcripts.gtf before merging based on coverage, FPKM and status?

Now I am trying to run cuffquant, with 8 processors 4G each on one sample (~40M reads), with the merged.gtf file as reference. It seemed to work fine, but now it is stuck for few hours on
"> Processing Locus chrX:151168793-151474354 [************************ ] 99%".
Question 2:
Is it some parameter issue? What should I do?
command line for cuffquant:
cuffquant -p 8 -o TEST_1sample -u -b path2/genome.fa path2/merged.gtf path2/Sample1.bam

Any help would be much appreciated! Thank you in advance
Yu

------------------------------------------ an UPDATE ----------------------------------

In Russian they say "morning is wiser than evening", so I went to sleep and let my computer continue working. It is almost 10AM now.
The good news Cuffquant seems to overcome this particular locus (which means it was not stuck), but it is extremely slow. Specifically, I see the following:
**
[06:21:14] Learning bias parameters.
[07:14:21] Quantifying expression levels in locus.
> Processing Locus chr3:55586402-55587294 [*** ] 15%
****!!! it is 9:45AM !!!! now ******
If 15% is scalable to reflect the time, cuffquant will finish this step in 16 hours (is it the last step??) !!!!!

So let me rephrase the question - is it normal behavior of cuffquant to process one BAM file for >12 hours on 8 processors?
How much memory per processor does it actually need? Will it be faster if I run it on 32 processors, but with 1 or 2G each?
I would really appreciate if some of you could share your experience regarding running times and requirements of cuffquant, how much does it speed things up afterwards (the next step will be cuffnorm) and if there is a better way to run it. It would be really nice if someone had some benchmarking data on these.

Best,
Yu
--------------------------------------------------------------------------------------------------------------------------

Last edited by yehudithasin; 06-27-2014, 09:08 AM.

Yu
Tags: None

Previous template Next

Latest Developments in Precision Medicine

by seqadmin

Technological advances have led to drastic improvements in the field of precision medicine, enabling more personalized approaches to treatment. This article explores four leading groups that are overcoming many of the challenges of genomic profiling and precision medicine through their innovative platforms and technologies.

Somatic Genomics
“We have such a tremendous amount of genetic diversity that exists within each of us, and not just between us as individuals,”...
- Channel: Articles
05-24-2024, 01:16 PM
Recent Advances in Sequencing Analysis Tools

by seqadmin

The sequencing world is rapidly changing due to declining costs, enhanced accuracies, and the advent of newer, cutting-edge instruments. Equally important to these developments are improvements in sequencing analysis, a process that converts vast amounts of raw data into a comprehensible and meaningful form. This complex task requires expertise and the right analysis tools. In this article, we highlight the progress and innovation in sequencing analysis by reviewing several of the...
- Channel: Articles
05-06-2024, 07:48 AM

Topics	Statistics	Last Post
Genetic Mosaicism More Prevalent Than Previously Thought by seqadmin Started by seqadmin, 05-30-2024, 03:16 PM	0 responses 18 views 0 likes	Last Post by seqadmin 05-30-2024, 03:16 PM
Comprehensive Sequencing of Great Ape Sex Chromosomes Yields Insights into Evolution and Genetic Variability by seqadmin Started by seqadmin, 05-29-2024, 01:32 PM	0 responses 18 views 0 likes	Last Post by seqadmin 05-29-2024, 01:32 PM
New Toolkit Enhances Plant Mitochondrial Genome Research by seqadmin Started by seqadmin, 05-24-2024, 07:15 AM	0 responses 209 views 0 likes	Last Post by seqadmin 05-24-2024, 07:15 AM
Catalog of Gene-Isoform Variation in Developing Human Brain by seqadmin Started by seqadmin, 05-23-2024, 10:28 AM	0 responses 225 views 0 likes	Last Post by seqadmin 05-23-2024, 10:28 AM

Seqanswers Leaderboard Ad

Announcement

Cuffmerge and Cuffquant questions - help please!

Latest Articles

ad_right_rmr

News