Unconfigured Ad

Collapse
X
 
  • Filter
  • Time
  • Show
Clear All
new posts
  • dlepe
    Junior Member
    • Jul 2014
    • 8

    sum of FPKMs?

    I'm analyzing the expression levels of certain genes in different tissues with data from a database and I need to count two different genes as one because I know by experimental data that they were erroneously annotated.

    The expression levels in the database are in FPKM and I know I can't simple make the sum of the two genes to count it as one.

    If I had raw counts what I would do is

    gene A = 400, gene B = 300, counting them as a single gene = 700.

    what would be the best thing to do this with FPKMs?

    gene A = 12 FPKM
    gene B = 20 FPKM
    as single gene = x ?
    Last edited by dlepe; 07-23-2014, 11:51 AM.
  • blakeoft
    Member
    • Oct 2013
    • 79

    #2
    FPKM is fragments per kilobase of transcript per million mapped reads.

    So then
    x = total number of fragments / ((total number of bases of transcipt / 1000) * (mapped fragments / 1000000))
    = (fragments mapped to gene A + fragments mapped to gene B) / ((bases of gene A + bases of gene B) / 1000 * (mapped fragments / 1000000))

    This would be if gene A and gene B did not overlap (and by that I mean that no read is mapped to both gene A and gene B). If they do, you'll have to use something like the inclusion-exclusion principle. I don't think you can simply add the two FPKM values, like you mentioned.

    Comment

    • dlepe
      Junior Member
      • Jul 2014
      • 8

      #3
      The thing is I don´t have the total number of mapped fragments from the libraries, I would have to try to see if the raw data is available somewhere and do the mapping myself..

      Since I'm trying to get an estimation of the correlated expression between the gene in question to another gene a friend suggested to simply use the average of gene A and gene B as the expression value I'm trying to find.

      His reasoning is that since FPKMs are normalized by length, and assuming that the number of raw counts in gene A and B similar, the FPKM for only gene A or B should be very similar to the number of FPKMs we'd get if we calculate the FPKMs for they both as a single gene.

      Comment

      • blakeoft
        Member
        • Oct 2013
        • 79

        #4
        I suppose you could do an average. I think a weighted average would be better suited for this. You could weight each FPKM value by the length of the corresponding gene.

        Comment

        • dlepe
          Junior Member
          • Jul 2014
          • 8

          #5
          Yeah I guess, I'll see how that goes, thanks.

          Comment

          • blakeoft
            Member
            • Oct 2013
            • 79

            #6
            I just did the math, and the weighted average is what you want, provided the genes don't overlap like I previously stated. So if gene A has FPKM a, and gene B has FPKM b, you want:

            a * |A| + b * |B|
            |A| + |B|

            where |x| is the length of gene x.

            Edit: If you want, I can type up my reasoning in latex. I just don't know of a nice way to display fractions on seqanswers.

            Comment

            • dlepe
              Junior Member
              • Jul 2014
              • 8

              #7
              awesome, I'll look into it, thanks again.

              Comment

              Latest Articles

              Collapse

              • SEQadmin2
                From Collection to Sequencing: Why Sample Preparation and Preservation Define Sequencing Data
                by SEQadmin2


                Data variability is still an issue in sequencing technologies despite the advances in reproducibility and accuracy of these platforms. But the problem does not originate in the sequencing itself, but in the previous steps, before the sample reaches the sequencer.


                The first step is collection, followed by preservation and sample preparation for analysis. Most scientists overlook those steps, but not being careful might just be skewing the experiment’s results.
                ...
                06-02-2026, 10:05 AM
              • SEQadmin2
                Single-Cell Sequencing at an Inflection Point: Early Impacts of New Platforms and Emerging Trends
                by SEQadmin2


                With the launch of new single-cell sequencing platforms in 2026, the field stands at an exciting inflection point. This article surveys the most impactful advances in the field and discusses how they’re reshaping research in cancer, immunology, and beyond.


                Introduction

                Single-cell sequencing technologies have undergone remarkable advances over the past decade, transitioning from low-throughput experimental approaches to highly scalable platforms capable of...
                05-22-2026, 06:42 AM
              • SEQadmin2
                Environmental Genomics in the Age of NGS: From Microbes to Conservation Strategies
                by SEQadmin2

                Studying ecosystems means dealing with complex, multi-species communities that are hard to observe at scale. This complexity, however, hides many important questions to be answered, from how biogeochemical cycles work and how climate change can affect species distribution to how conservation strategies can work best.


                Genomics, particularly since the expansion of NGS, has transformed ecosystem ecology. By sequencing environmental DNA, we can now assess biodiversity without direct...
                05-06-2026, 09:04 AM

              ad_right_rmr

              Collapse

              News

              Collapse

              Topics Statistics Last Post
              Started by SEQadmin2, Today, 08:59 AM
              0 responses
              7 views
              0 reactions
              Last Post SEQadmin2  
              Started by SEQadmin2, 06-02-2026, 12:03 PM
              0 responses
              21 views
              0 reactions
              Last Post SEQadmin2  
              Started by SEQadmin2, 06-02-2026, 11:40 AM
              0 responses
              14 views
              0 reactions
              Last Post SEQadmin2  
              Started by SEQadmin2, 05-28-2026, 11:40 AM
              0 responses
              29 views
              0 reactions
              Last Post SEQadmin2  
              Working...