Seqanswers Leaderboard Ad

Collapse
X
 
  • Filter
  • Time
  • Show
Clear All
new posts
  • jkozubek
    Member
    • Mar 2011
    • 18

    Allele Distributions

    Hello:

    I found 320 mutations among 16 mouse clones using GATK. However, I noticed something strange. In only 40 of those 0/1 calls are there more reads with the ALT call (1) than reads with the REF call (0). In the vast remainder, 280 mutations I found, the reads that carry the ALT calls are a smaller fraction about 10-40 percent.

    If all my 320 mutations were real, I would expect the REF/ALT distributions to be about equal. If it was true and normally distributed, I'd expect at least 50% of these mutaitons to show up with more ALT calls and 50% to show up with more REF calls. Is it natural to find fewer reads with the ALT call in genuine mutations?

    I was thinking about using the bionomial distribution in Excel to remove these, something like -BINOMDIST(20,100,.50,FALSE) where I found 20 ALT calls with a Depth of 100 and expected to see 50% ALT calls.

    Jim
  • vivek_
    PhD Student
    • Jul 2012
    • 164

    #2
    I can't point to a proper source right away but I remember reading in a publication that the alternate allele fraction for heterozygous calls could be anywhere between 20 - 80% at variation sites.

    Comment

    • jkozubek
      Member
      • Mar 2011
      • 18

      #3
      I think so too. Still it seems odd that the distribution is lopsided so that many more times its the ALT that is seen in only 20-30% in the reads.

      Comment

      • swbarnes2
        Senior Member
        • May 2008
        • 910

        #4
        Keep in mind that there is some bias towards the reference allele. If your aligner only accepts 3 discrepancies between read and reference, reads matching the reference allele with 1,2 or 3 mistakes will align there, but only reads with 1 or 2 errors will align if they also have the alternate allele.

        You could try aligning to a reference that contains both alleles, see if that fixes things.

        Comment

        Latest Articles

        Collapse

        • seqadmin
          Pathogen Surveillance with Advanced Genomic Tools
          by seqadmin




          The COVID-19 pandemic highlighted the need for proactive pathogen surveillance systems. As ongoing threats like avian influenza and newly emerging infections continue to pose risks, researchers are working to improve how quickly and accurately pathogens can be identified and tracked. In a recent SEQanswers webinar, two experts discussed how next-generation sequencing (NGS) and machine learning are shaping efforts to monitor viral variation and trace the origins of infectious...
          03-24-2025, 11:48 AM
        • seqadmin
          New Genomics Tools and Methods Shared at AGBT 2025
          by seqadmin


          This year’s Advances in Genome Biology and Technology (AGBT) General Meeting commemorated the 25th anniversary of the event at its original venue on Marco Island, Florida. While this year’s event didn’t include high-profile musical performances, the industry announcements and cutting-edge research still drew the attention of leading scientists.

          The Headliner
          The biggest announcement was Roche stepping back into the sequencing platform market. In the years since...
          03-03-2025, 01:39 PM

        ad_right_rmr

        Collapse

        News

        Collapse

        Topics Statistics Last Post
        Started by seqadmin, 03-20-2025, 05:03 AM
        0 responses
        49 views
        0 reactions
        Last Post seqadmin  
        Started by seqadmin, 03-19-2025, 07:27 AM
        0 responses
        57 views
        0 reactions
        Last Post seqadmin  
        Started by seqadmin, 03-18-2025, 12:50 PM
        0 responses
        50 views
        0 reactions
        Last Post seqadmin  
        Started by seqadmin, 03-03-2025, 01:15 PM
        0 responses
        201 views
        0 reactions
        Last Post seqadmin  
        Working...