Unconfigured Ad

Collapse
X
 
  • Filter
  • Time
  • Show
Clear All
new posts
  • caballien
    Junior Member
    • Nov 2012
    • 7

    cufflinks stuck at a locus?

    When attempting to run cufflinks v2.0.2 (command line below) it quickly processes for moments and then gets stuck processing locus chr1:16765605-16765782. I've read on other threads to wait it out, but the job has been running for over 51hrs on 8 processors. It doesn't matter if I use the -g option or not or the -r option with a .gtf for all known rRNAs and tRNAs (downloaded from rmsk on UCSC table browser filtering for rRNA and tRNAs and then cat the files together). The reads were aligned with tophat2 without problems (~90% aligned or 50,000,000 paired-end reads from ribosomal depletion). Any suggestions? My only thought is to use the -r option with all of rmsk, but I'm not sure if that will help.

    Code:
    cufflinks -p 8 -M rrnatrnacatandsort.gtf -o C31_cufout accepted_hits.bam
    or
    Code:
    cufflinks -p 8 -M rrnatrnacatandsort.gtf -g ucscknowngenes.gtf -o C31_cufout accepted_hits.bam
    or
    Code:
    cufflinks -p 8 -g ucscknowngenes.gtf -o C31_cufout accepted_hits.bam
    or
    Code:
    cufflinks -p 8 -o C31_cufout accepted_hits.bam
  • g781
    Member
    • May 2010
    • 25

    #2
    Hi Caballien,

    Has you found out what's going on about cufflinks got stuck at a locus?

    I've got a similar problem too at a stage of doing "Inspecting reads and determining fragment length distribution". I have no idea what's happening with this problem. I've used a very trick way to run my data as well, but it doesn't mean I've addressed this problem.

    Code:
    #cufflinks -p 8 -M mask.gff -o ./data.th.cl ./data.th/accepted_hits.bam
    Here is how I ran cufflinks with the same dataset on 3 computers? The mask.gff is a file to exclude some genes I don't need.

    Computer-1:
    HP G7 server with CentOS6 and 28GB RAM. It got stuck at "Processing Locus Tb427_01_v4:1064380-1064569" .
    The same problem occurred when running with cufflinks complied from source code.

    Computer-2:
    DELL PC with CentOS 5.7 and 8GB RAM. It got stuck at "Processing Locus Tb427_01_v4:1064380-1064569".

    PS: 1 )Tb427 is T.brucei species, 01 is chr1 and v4 is just a version. The length of chr1 is 1064569.
    2) It works when the maks.gff includes all of items expect CDS and exon, but it doesn't make sense.

    Computer-3:
    Rocks cluster server with CentOS 5.6 and 256GB RAM. It was running as well but was over my head to figure out why it works.

    ---

    What I found among these three computers are that 1) Computer-3 makes great use of virtual memory and stack data when tracking from "top" command line, 2) the other two computers get stuck at processing loci when virtual memory and stack data reach about 3GB even if it's running whole the day, 3) There has a lot of physical memory left for both computer-1/2, and 4) All of three computers have more then 10GB swap space.

    Does anyone has a idea to explain this case or does anyone think that cufflinks allocating data to shuffle among swap ,stack and physical memory has something wrong?

    Many thanks in advance.

    Comment

    • g781
      Member
      • May 2010
      • 25

      #3
      For my test to Cufflinks in these couple of weeks, to ignore reads annotated like a rRNA, tRNA and so forth will solve problem of getting stuck at "Processing Locus ... ... ...". It has mentioned in the Cufflinks website.

      Hopefully, the information will help people meet the same problem in the future.

      Comment

      Latest Articles

      Collapse

      • SEQadmin2
        Nine Things a Sample Prep Scientist Thinks About Before Sequencing
        by SEQadmin2


        I’m not a sequencing expert. I’m a purification scientist who uses NGS to evaluate workflows my group develops. With this perspective, we think about the sample first and the NGS workflow second. The sequencer is an exceptionally honest reporter, but it can only report on what you give it, so whether you get clean, interpretable data from an NGS workflow is largely determined before you begin.


        Here are nine questions we think about, in roughly the order they matter, before...
        06-18-2026, 07:11 AM
      • SEQadmin2
        From Collection to Sequencing: Why Sample Preparation and Preservation Define Sequencing Data
        by SEQadmin2


        Data variability is still an issue in sequencing technologies despite the advances in reproducibility and accuracy of these platforms. But the problem does not originate in the sequencing itself, but in the previous steps, before the sample reaches the sequencer.


        The first step is collection, followed by preservation and sample preparation for analysis. Most scientists overlook those steps, but not being careful might just be skewing the experiment’s results.
        ...
        06-02-2026, 10:05 AM

      ad_right_rmr

      Collapse

      News

      Collapse

      Topics Statistics Last Post
      Started by SEQadmin2, 06-17-2026, 06:09 AM
      0 responses
      30 views
      0 reactions
      Last Post SEQadmin2  
      Started by SEQadmin2, 06-09-2026, 11:58 AM
      0 responses
      44 views
      0 reactions
      Last Post SEQadmin2  
      Started by SEQadmin2, 06-05-2026, 10:09 AM
      0 responses
      50 views
      0 reactions
      Last Post SEQadmin2  
      Started by SEQadmin2, 06-04-2026, 08:59 AM
      0 responses
      51 views
      0 reactions
      Last Post SEQadmin2  
      Working...