Seqanswers Leaderboard Ad

Collapse

Announcement

Collapse
No announcement yet.
X
 
  • Filter
  • Time
  • Show
Clear All
new posts

  • cufflinks stuck at a locus?

    When attempting to run cufflinks v2.0.2 (command line below) it quickly processes for moments and then gets stuck processing locus chr1:16765605-16765782. I've read on other threads to wait it out, but the job has been running for over 51hrs on 8 processors. It doesn't matter if I use the -g option or not or the -r option with a .gtf for all known rRNAs and tRNAs (downloaded from rmsk on UCSC table browser filtering for rRNA and tRNAs and then cat the files together). The reads were aligned with tophat2 without problems (~90% aligned or 50,000,000 paired-end reads from ribosomal depletion). Any suggestions? My only thought is to use the -r option with all of rmsk, but I'm not sure if that will help.

    Code:
    cufflinks -p 8 -M rrnatrnacatandsort.gtf -o C31_cufout accepted_hits.bam
    or
    Code:
    cufflinks -p 8 -M rrnatrnacatandsort.gtf -g ucscknowngenes.gtf -o C31_cufout accepted_hits.bam
    or
    Code:
    cufflinks -p 8 -g ucscknowngenes.gtf -o C31_cufout accepted_hits.bam
    or
    Code:
    cufflinks -p 8 -o C31_cufout accepted_hits.bam

  • #2
    Hi Caballien,

    Has you found out what's going on about cufflinks got stuck at a locus?

    I've got a similar problem too at a stage of doing "Inspecting reads and determining fragment length distribution". I have no idea what's happening with this problem. I've used a very trick way to run my data as well, but it doesn't mean I've addressed this problem.

    Code:
    #cufflinks -p 8 -M mask.gff -o ./data.th.cl ./data.th/accepted_hits.bam
    Here is how I ran cufflinks with the same dataset on 3 computers? The mask.gff is a file to exclude some genes I don't need.

    Computer-1:
    HP G7 server with CentOS6 and 28GB RAM. It got stuck at "Processing Locus Tb427_01_v4:1064380-1064569" .
    The same problem occurred when running with cufflinks complied from source code.

    Computer-2:
    DELL PC with CentOS 5.7 and 8GB RAM. It got stuck at "Processing Locus Tb427_01_v4:1064380-1064569".

    PS: 1 )Tb427 is T.brucei species, 01 is chr1 and v4 is just a version. The length of chr1 is 1064569.
    2) It works when the maks.gff includes all of items expect CDS and exon, but it doesn't make sense.

    Computer-3:
    Rocks cluster server with CentOS 5.6 and 256GB RAM. It was running as well but was over my head to figure out why it works.

    ---

    What I found among these three computers are that 1) Computer-3 makes great use of virtual memory and stack data when tracking from "top" command line, 2) the other two computers get stuck at processing loci when virtual memory and stack data reach about 3GB even if it's running whole the day, 3) There has a lot of physical memory left for both computer-1/2, and 4) All of three computers have more then 10GB swap space.

    Does anyone has a idea to explain this case or does anyone think that cufflinks allocating data to shuffle among swap ,stack and physical memory has something wrong?

    Many thanks in advance.

    Comment


    • #3
      For my test to Cufflinks in these couple of weeks, to ignore reads annotated like a rRNA, tRNA and so forth will solve problem of getting stuck at "Processing Locus ... ... ...". It has mentioned in the Cufflinks website.

      Hopefully, the information will help people meet the same problem in the future.

      Comment

      Latest Articles

      Collapse

      • seqadmin
        Recent Developments in Metagenomics
        by seqadmin





        Metagenomics has improved the way researchers study microorganisms across diverse environments. Historically, studying microorganisms relied on culturing them in the lab, a method that limits the investigation of many species since most are unculturable1. Metagenomics overcomes these issues by allowing the study of microorganisms regardless of their ability to be cultured or the environments they inhabit. Over time, the field has evolved, especially with the advent...
        09-23-2024, 06:35 AM
      • seqadmin
        Understanding Genetic Influence on Infectious Disease
        by seqadmin




        During the COVID-19 pandemic, scientists observed that while some individuals experienced severe illness when infected with SARS-CoV-2, others were barely affected. These disparities left researchers and clinicians wondering what causes the wide variations in response to viral infections and what role genetics plays.

        Jean-Laurent Casanova, M.D., Ph.D., Professor at Rockefeller University, is a leading expert in this crossover between genetics and infectious...
        09-09-2024, 10:59 AM

      ad_right_rmr

      Collapse

      News

      Collapse

      Topics Statistics Last Post
      Started by seqadmin, Yesterday, 04:51 AM
      0 responses
      8 views
      0 likes
      Last Post seqadmin  
      Started by seqadmin, 10-01-2024, 07:10 AM
      0 responses
      13 views
      0 likes
      Last Post seqadmin  
      Started by seqadmin, 09-30-2024, 08:33 AM
      0 responses
      18 views
      0 likes
      Last Post seqadmin  
      Started by seqadmin, 09-26-2024, 12:57 PM
      0 responses
      16 views
      0 likes
      Last Post seqadmin  
      Working...
      X