Seqanswers Leaderboard Ad

Collapse

Announcement

Collapse
No announcement yet.
X
 
  • Filter
  • Time
  • Show
Clear All
new posts

  • determining if an identified SNP is a sequence artifact

    Hi all,

    I'm have a question. The situation is that I was given exome-seq data for 120 tumor/normal matched pairs (240 samples all together) to very high depth (~400x). But I was only given the SNP calls and not raw data or mapped bam files so I can't go back to the sequence data itself to look at things a little more closely. The SNPs were called using mutect, for which I have little experience with and have been trying to read up on and get information about, but not really finding answers to my questions.

    So, in the outputted VCF files, in the filter field I have either PASS or mf1 (which is stated that it was filter out by mutect v1). But in looking at the data, I have a particular location, in an interesting gene that has been on our radar for awhile for various reasons. At this position, in ~80 of the samples, there is an identified SNP (always somatic, not in the matched normal). The coverage at the position is for the most part >=100x in both the tumor and normal and the alt allele in the tumor hovers around 10%. But, in all cases, the filter flag has an 'mf1', suggesting that mutect filtered it out. By all indication, this looks real to me, the mapping quality and other metrics look good. My question would be, could this really be a false positive site? If, this is a sequence artifact, how could so many samples have the same SNP? To me (and I am be wrong), but I would think that a sequence artifact would be random and not present at the same location in so many samples? Could this actually be a real site? Any thoughts?

  • #2
    An artifact would be MORE likely to be in all or most samples, while the chances of a real novel SNP occurring spontaneously in multiple unrelated tumors is virtually zero.

    Comment


    • #3
      Thanks. I guess I was thinking about it in the opposite way, in that if the same site was affected in many samples then it would be real. The odds of the same site being hit many times over be a rare event.

      Comment

      Latest Articles

      Collapse

      • seqadmin
        Quality Control Essentials for Next-Generation Sequencing Workflows
        by seqadmin




        Like all molecular biology applications, next-generation sequencing (NGS) workflows require diligent quality control (QC) measures to ensure accurate and reproducible results. Proper QC begins at nucleic acid extraction and continues all the way through to data analysis. This article outlines the key QC steps in an NGS workflow, along with the commonly used tools and techniques.

        Nucleic Acid Quality Control
        Preparing for NGS starts with isolating the...
        02-10-2025, 01:58 PM
      • seqadmin
        An Introduction to the Technologies Transforming Precision Medicine
        by seqadmin


        In recent years, precision medicine has become a major focus for researchers and healthcare professionals. This approach offers personalized treatment and wellness plans by utilizing insights from each person's unique biology and lifestyle to deliver more effective care. Its advancement relies on innovative technologies that enable a deeper understanding of individual variability. In a joint documentary with our colleagues at Biocompare, we examined the foundational principles of precision...
        01-27-2025, 07:46 AM

      ad_right_rmr

      Collapse

      News

      Collapse

      Topics Statistics Last Post
      Started by seqadmin, 02-07-2025, 09:30 AM
      0 responses
      71 views
      0 likes
      Last Post seqadmin  
      Started by seqadmin, 02-05-2025, 10:34 AM
      0 responses
      113 views
      0 likes
      Last Post seqadmin  
      Started by seqadmin, 02-03-2025, 09:07 AM
      0 responses
      86 views
      0 likes
      Last Post seqadmin  
      Started by seqadmin, 01-31-2025, 08:31 AM
      0 responses
      48 views
      0 likes
      Last Post seqadmin  
      Working...
      X