Unconfigured Ad

Collapse
X
 
  • Filter
  • Time
  • Show
Clear All
new posts
  • oscarluoinau
    Junior Member
    • Nov 2011
    • 6

    Cufflinks memory usage

    Hi all,
    I am having trouble with running Cufflinks on PE RNA-Seq libraries generated from HiSeq machine. I have used Tophat to successfully mapped those PE reads (about 160 millions reads), which gave me a BAM file about 6GB for each library. Then I fed the BAM file to Cufflinks running with 20 cores. Now the problem is it seems Cufflinks is taking more than 120GB ram, and is taking very long (about a week) to run one library. Have any of you had similar experience? Am I doing something wrong? Any suggestions? Thanks!
  • Nicolas
    Member
    • Apr 2009
    • 41

    #2
    Which reference annotations are you using? I had similar experience with gencode annotations (>2.5 millions annotations). I then switched to RefSeq annotations and the Cufflinks runs are now much shorter. Of course, it's quantify much less potential transcripts, but for most applications, that can be sufficient.

    Comment

    • jbrwn
      Member
      • Mar 2011
      • 37

      #3
      in addition to posting your reference, you may want to post which options you're utilizing in cufflinks.

      Comment

      • oscarluoinau
        Junior Member
        • Nov 2011
        • 6

        #4
        Right. I am also using the Gencode annotation. I think I will experiment with other annotation file to see how it goes. The options I am utilizing in Cufflinks are simply specifying the reads come from PE reads (i.e. --fr-unstranded).

        Comment

        • kjlee
          Member
          • Jun 2011
          • 12

          #5
          oscar,

          did you ever get an answer or a work around? I am running cufflinks on a similar sized bam file and am also running out of memory.

          cheers,

          Comment

          • oscarluoinau
            Junior Member
            • Nov 2011
            • 6

            #6
            I couldn't get the job done until upgraded to the latest version of Cufflinks which seems to use less memory. Good luck!

            Comment

            • kjlee
              Member
              • Jun 2011
              • 12

              #7
              Hey Oscar,

              I am running the newest version of Cufflinks (I literally downloaded it this week). I got Cufflinks to run on a small sub-set of reads (about 4 million paired end reads, 100nt with ~200nt inner gap). But the whole data-set is ~50X bigger. How many reads did you use? And what (approximately) was the memory usage for your file size (RAM per GB of the bam/sam file)?

              Cheers,

              Comment

              • oscarluoinau
                Junior Member
                • Nov 2011
                • 6

                #8
                Hi,
                I don't remember the exact numbers as it was about a year ago. One thing I do remember is I was lucky enough to utilize a machine with 1TB RAM, and I used about 500GB for about 160 million reads. I hope this helps. Good luck!

                Comment

                • kjlee
                  Member
                  • Jun 2011
                  • 12

                  #9
                  We only have one compute node with that much memory on our cluster and I didn't want to usurp it if I didn't have to. But I guess that's what the resources are for. Thanks Oscar. (Also, about how long did it take to run?)

                  Cheers,

                  Comment

                  • oscarluoinau
                    Junior Member
                    • Nov 2011
                    • 6

                    #10
                    Expect it to run longer than a week.

                    Comment

                    Latest Articles

                    Collapse

                    • SEQadmin2
                      Nine Things a Sample Prep Scientist Thinks About Before Sequencing
                      by SEQadmin2


                      I’m not a sequencing expert. I’m a purification scientist who uses NGS to evaluate workflows my group develops. With this perspective, we think about the sample first and the NGS workflow second. The sequencer is an exceptionally honest reporter, but it can only report on what you give it, so whether you get clean, interpretable data from an NGS workflow is largely determined before you begin.


                      Here are nine questions we think about, in roughly the order they matter, before...
                      06-18-2026, 07:11 AM
                    • SEQadmin2
                      From Collection to Sequencing: Why Sample Preparation and Preservation Define Sequencing Data
                      by SEQadmin2


                      Data variability is still an issue in sequencing technologies despite the advances in reproducibility and accuracy of these platforms. But the problem does not originate in the sequencing itself, but in the previous steps, before the sample reaches the sequencer.


                      The first step is collection, followed by preservation and sample preparation for analysis. Most scientists overlook those steps, but not being careful might just be skewing the experiment’s results.
                      ...
                      06-02-2026, 10:05 AM
                    • SEQadmin2
                      Single-Cell Sequencing at an Inflection Point: Early Impacts of New Platforms and Emerging Trends
                      by SEQadmin2


                      With the launch of new single-cell sequencing platforms in 2026, the field stands at an exciting inflection point. This article surveys the most impactful advances in the field and discusses how they’re reshaping research in cancer, immunology, and beyond.


                      Introduction

                      Single-cell sequencing technologies have undergone remarkable advances over the past decade, transitioning from low-throughput experimental approaches to highly scalable platforms capable of...
                      05-22-2026, 06:42 AM

                    ad_right_rmr

                    Collapse

                    News

                    Collapse

                    Topics Statistics Last Post
                    Started by SEQadmin2, 06-17-2026, 06:09 AM
                    0 responses
                    21 views
                    0 reactions
                    Last Post SEQadmin2  
                    Started by SEQadmin2, 06-09-2026, 11:58 AM
                    0 responses
                    40 views
                    0 reactions
                    Last Post SEQadmin2  
                    Started by SEQadmin2, 06-05-2026, 10:09 AM
                    0 responses
                    46 views
                    0 reactions
                    Last Post SEQadmin2  
                    Started by SEQadmin2, 06-04-2026, 08:59 AM
                    0 responses
                    49 views
                    0 reactions
                    Last Post SEQadmin2  
                    Working...