Seqanswers Leaderboard Ad

Collapse

Announcement

Collapse
No announcement yet.
X
 
  • Filter
  • Time
  • Show
Clear All
new posts

  • How to estimate tumor purity?

    Typically, tumor samples contain normal cells to a certain extent. Some programs that deal with somatic mutations take tumor purity percentage as an input, e.g. ExomeCNV, VarScan, etc.

    I noticed that ExomeCNV's LOH calling algorithm supposedly can be used to estimate the % contaminated by normal cells. But it is not well documented in their User Guide. Can anyone show me how to do this?

    Or are there other programs that can estimate tumor purity?

    Thanks in advance.

  • #2
    Succinctly, you expect heterozygous somatic SNVs to be present in a sample at a frequency around 50% (and homozygous changes to be present at 100%). If your tumor is impure, these fractions will drop. If your sample is 10% normal cells, your numbers would be 45% and 90%. Regions with LOH or single copy number loss provide larger numbers of somatic mutations that should occur at nearly 100%, making this estimation easier.

    Comment


    • #3
      Is this what you are looking for?

      Comment


      • #4
        Originally posted by davidblaney View Post
        Is this what you are looking for?

        http://genomebiology.com/2013/14/7/R80/abstract
        looks good. Thanks a lot!

        Comment


        • #5
          This is the function in ExomeCNV to estimate fraction of normal cells in tumor sample. It takes two arguments. The first one is the logR (aka log intensity) value vector for different regions indexed from 1 to length(logR). The second argument is a vector of indexes of the logR vector that is determined to be LOH.

          ========================================
          guesstimate.contamination <-
          function(logR, region.idx=NULL) {
          if (is.null(region.idx)) region.idx = 1:length(logR)
          med.logR = median(logR[region.idx], na.rm=TRUE)
          if (med.logR < 0) { rho = 0.5 } else { rho = 1.5 }
          return((2**med.logR - rho)/(1 - rho))
          }
          =====================================

          From the look of it, it takes the median of the logR values in LOH regions and then apply this formula

          let logR_m be the logR median among the LOH regions

          if logR_m >=0, contamination = 3 - 2^(logR_m + 1)
          if logR_m < 0, contamination = 2^(logR_m + 1) - 1

          It seems to me it is theoretically possible to have contamination to be outside the bound of 0 and 1. Do you think this function is correct?

          Comment


          • #6
            According to formula described in their paper, the estimation formula is

            c = 1 - 2*average(|BAF_LOH - 0.5|)



            This formula seems to make more sense. It also won't make c goes out of bound.

            So which formula is correct???

            Comment

            Latest Articles

            Collapse

            • seqadmin
              New Genomics Tools and Methods Shared at AGBT 2025
              by seqadmin


              This year’s Advances in Genome Biology and Technology (AGBT) General Meeting commemorated the 25th anniversary of the event at its original venue on Marco Island, Florida. While this year’s event didn’t include high-profile musical performances, the industry announcements and cutting-edge research still drew the attention of leading scientists.

              The Headliner
              The biggest announcement was Roche stepping back into the sequencing platform market. In the years since...
              03-03-2025, 01:39 PM
            • seqadmin
              Investigating the Gut Microbiome Through Diet and Spatial Biology
              by seqadmin




              The human gut contains trillions of microorganisms that impact digestion, immune functions, and overall health1. Despite major breakthroughs, we’re only beginning to understand the full extent of the microbiome’s influence on health and disease. Advances in next-generation sequencing and spatial biology have opened new windows into this complex environment, yet many questions remain. This article highlights two recent studies exploring how diet influences microbial...
              02-24-2025, 06:31 AM

            ad_right_rmr

            Collapse

            News

            Collapse

            Topics Statistics Last Post
            Started by seqadmin, Today, 12:50 PM
            0 responses
            10 views
            0 likes
            Last Post seqadmin  
            Started by seqadmin, 03-03-2025, 01:15 PM
            0 responses
            181 views
            0 likes
            Last Post seqadmin  
            Started by seqadmin, 02-28-2025, 12:58 PM
            0 responses
            276 views
            0 likes
            Last Post seqadmin  
            Started by seqadmin, 02-24-2025, 02:48 PM
            0 responses
            663 views
            0 likes
            Last Post seqadmin  
            Working...
            X