Unconfigured Ad

Collapse
X
 
  • Filter
  • Time
  • Show
Clear All
new posts
  • statsteam
    Member
    • Sep 2009
    • 19

    Cufflinks, differentially expressed genes

    Hi,

    I am trying to run edgeR or DEGseq using the output from cufflinks.
    I usually use mapped reads count as an input to edgeR or DEGseq. What cufflinks output do I need to use for an input to edgeR or DEGseq? I am thinking about adding "coverage" of each isoform for a gene from isoforms.fpkm_tracking file. Does this make sense?

    Thank you!
  • chadn737
    Senior Member
    • Jan 2009
    • 392

    #2
    edgeR and DEGseq take raw counts. They then do their own normalizations. Taking results from cufflinks and trying to use this in any of these programs is not a good approach, even though a lot of people try it for some reason. If you want to use the output of Cufflinks for differential expression, then I would stick to the Cufflinks pipeline and use Cuffdiff.

    Otherwise, extract read counts for each gene from your bam/sam/bed file and use this as input for edgeR/DEGseq.

    Comment

    • statsteam
      Member
      • Sep 2009
      • 19

      #3
      I agree that we'd better stick to cuffdiff for differentially expressed gene analysis. Doe cuffdiff have "paired"-analysis feature for the data with replicates? The paired-analysis feature is the main reason I want to use edgeR.

      Comment

      • Thomas Doktor
        Senior Member
        • Apr 2009
        • 105

        #4
        Cuffdiff supports replicates but does not handle paired replicates to my knowledge.

        Btw, I would recommend using DESeq instead of DEGseq, the spelling is similar but the internal statistical modelling is very different.
        Last edited by Thomas Doktor; 02-02-2012, 04:14 AM.

        Comment

        • dietmar13
          Senior Member
          • Mar 2010
          • 107

          #5
          how many replicates in each condition do you have?

          you could also use SAMseq (samr v2 R-package). this package works with many kinds of designs: paired, quantitative, right censored (like overall survival).
          in my hands, SAMseq produced most significant genes (followed by edgeR, baySeq, DESeq, NOIseq, and far far behind cuffdiff) , which were rather robust in bootstrap validations.

          my design: 12 normal vs 12 cancer (paired, means from the same patient).

          Comment

          • IBseq
            Member
            • Jul 2012
            • 56

            #6
            Hello guys,
            if anyone knows, could you please tell me why is this happening:

            i ran cufflinks on galaxy with default parameters and had satisfactory results.
            I then ran the same samples with same parameters except changing max intron length from 300000 to 600000

            in the second run have the exact number of transcripts but the FPKM values are much much lower..

            Any suggestion?

            Thanks,
            ibseq

            Comment

            Latest Articles

            Collapse

            • SEQadmin2
              From Collection to Sequencing: Why Sample Preparation and Preservation Define Sequencing Data
              by SEQadmin2


              Data variability is still an issue in sequencing technologies despite the advances in reproducibility and accuracy of these platforms. But the problem does not originate in the sequencing itself, but in the previous steps, before the sample reaches the sequencer.


              The first step is collection, followed by preservation and sample preparation for analysis. Most scientists overlook those steps, but not being careful might just be skewing the experiment’s results.
              ...
              06-02-2026, 10:05 AM
            • SEQadmin2
              Single-Cell Sequencing at an Inflection Point: Early Impacts of New Platforms and Emerging Trends
              by SEQadmin2


              With the launch of new single-cell sequencing platforms in 2026, the field stands at an exciting inflection point. This article surveys the most impactful advances in the field and discusses how they’re reshaping research in cancer, immunology, and beyond.


              Introduction

              Single-cell sequencing technologies have undergone remarkable advances over the past decade, transitioning from low-throughput experimental approaches to highly scalable platforms capable of...
              05-22-2026, 06:42 AM
            • SEQadmin2
              Environmental Genomics in the Age of NGS: From Microbes to Conservation Strategies
              by SEQadmin2

              Studying ecosystems means dealing with complex, multi-species communities that are hard to observe at scale. This complexity, however, hides many important questions to be answered, from how biogeochemical cycles work and how climate change can affect species distribution to how conservation strategies can work best.


              Genomics, particularly since the expansion of NGS, has transformed ecosystem ecology. By sequencing environmental DNA, we can now assess biodiversity without direct...
              05-06-2026, 09:04 AM

            ad_right_rmr

            Collapse

            News

            Collapse

            Topics Statistics Last Post
            Started by SEQadmin2, Today, 08:59 AM
            0 responses
            10 views
            0 reactions
            Last Post SEQadmin2  
            Started by SEQadmin2, 06-02-2026, 12:03 PM
            0 responses
            21 views
            0 reactions
            Last Post SEQadmin2  
            Started by SEQadmin2, 06-02-2026, 11:40 AM
            0 responses
            17 views
            0 reactions
            Last Post SEQadmin2  
            Started by SEQadmin2, 05-28-2026, 11:40 AM
            0 responses
            31 views
            0 reactions
            Last Post SEQadmin2  
            Working...