Unconfigured Ad

Collapse
X
 
  • Filter
  • Time
  • Show
Clear All
new posts
  • weiw
    Junior Member
    • Sep 2013
    • 2

    Cuffdiff FPKM question

    Hello,

    I am fairly new to the world of bioinformatics. I have data from an Illumina RNAseq run we did with Arabidopsis RNA. I ran the reads through the Cufflinks/Cuffdiff programs to get transcript levels for genes. I've noticed that for some genes, the program will group a few (2-4) genes/loci together and give them all the same FPKM. But if I search for these genes they are obviously separate genes and not isoforms of one gene. I believe it is because these genes are highly similar in sequence. But if I go and look at my aligned reads with a genome browser, many times most of the reads will only align to one of the genes. However in some instances both or all the genes show equal amounts of reads aligning. So my question is what exactly is the FPKM for these "grouped" genes? Is it for only one of the genes, or is it distributed amongst all the genes in the group? How do I find out what the FPKM for the individual genes are? Thanks in advance to anyone with any ideas!
  • nidhi.bioinfo
    Junior Member
    • Oct 2013
    • 1

    #2
    did you got any solution for your problem. I am facing the same issue.

    Comment

    • weiw
      Junior Member
      • Sep 2013
      • 2

      #3
      Hi,

      Yes I actually did. When I ran my Cuffdiff analysis, I used my merged.gtf file as the reference genome, not the original reference genome. My basic understanding is that the merged file takes into account your experimental data, as well the original reference genome, and creates a "merged" version of the two. So in the case of the multiple loci, the merged file decided based on my data that some reads span the entire region covering those two or more loci, thus the multiple loci with one FPKM. To fix this, I ran Cuffdiff again with the original reference genome, not the merged one. Hope that made sense...

      Comment

      Latest Articles

      Collapse

      • SEQadmin2
        Nine Things a Sample Prep Scientist Thinks About Before Sequencing
        by SEQadmin2


        I’m not a sequencing expert. I’m a purification scientist who uses NGS to evaluate workflows my group develops. With this perspective, we think about the sample first and the NGS workflow second. The sequencer is an exceptionally honest reporter, but it can only report on what you give it, so whether you get clean, interpretable data from an NGS workflow is largely determined before you begin.

        Here are nine questions we think about, in roughly the order they matter, before...
        06-18-2026, 07:11 AM
      • SEQadmin2
        From Collection to Sequencing: Why Sample Preparation and Preservation Define Sequencing Data
        by SEQadmin2


        Data variability is still an issue in sequencing technologies despite the advances in reproducibility and accuracy of these platforms. But the problem does not originate in the sequencing itself, but in the previous steps, before the sample reaches the sequencer.


        The first step is collection, followed by preservation and sample preparation for analysis. Most scientists overlook those steps, but not being careful might just be skewing the experiment’s results.
        ...
        06-02-2026, 10:05 AM

      ad_right_rmr

      Collapse

      News

      Collapse

      Topics Statistics Last Post
      Started by SEQadmin2, Yesterday, 11:10 AM
      0 responses
      7 views
      0 reactions
      Last Post SEQadmin2  
      Started by SEQadmin2, 06-17-2026, 06:09 AM
      0 responses
      42 views
      0 reactions
      Last Post SEQadmin2  
      Started by SEQadmin2, 06-09-2026, 11:58 AM
      0 responses
      104 views
      0 reactions
      Last Post SEQadmin2  
      Started by SEQadmin2, 06-05-2026, 10:09 AM
      0 responses
      125 views
      0 reactions
      Last Post SEQadmin2  
      Working...