Seqanswers Leaderboard Ad

Collapse

Announcement

Collapse
No announcement yet.
X
 
  • Filter
  • Time
  • Show
Clear All
new posts

  • Samtools SNP calling

    Hello there
    I have a issue with sam SNP calling. I work with captured genomic sequences.
    The fold coverage is very high at 600X. I used BWA (mismatch penalty -7) to map the reads to the genome and used samtools to call SNPs. I used mpileup and then realised that a known SNP was not called by mpileup and I tried to investigate what is happening in that region with pileup. The output is as follows
    CFA15 1299612 T T 255 0 59 309 g,GG,,.g,,.G.g.,,,gg.G,gg,g,,.G.,,,,,,GG...gGG,,,g.ggggg,.ggG.gg..g,Gg.Gggg.G
    .g,,.gg,,,,gg.,g,g.,,.Gg,.,gg,gg,ggggG,..Ggg,,g,g,,,.g,gG,gGgg.g,GG,Gg,,..,,g.G,,,,,,,.G,,gg,gG.gg,gGGGg.GGGG,g,gGg,G,,g,,g,.,g,g..GG.G.,gggg
    GggG.G,,g,g,.,,..,gGG..G,G,,..g,gg,g,.,Ggg,.G,g,.,gGGGGg,G.GGg,.gggG,g,,,g,G.G..G...,g^]g^],^],^]g^]G !T!!]^^!^^^!^!^^^^!!^!^!!^!^^^!^^^^^^
    ^!!^^^!!!^^^!^!!!!!^^!!!^!!^^!^!!^!!!!^!^!^^^!!^^^^!!^^!^!^^^^!!^^^!!^!!^!!!!!^^^!!!^^!^!^^^^!^!!^!!!!^!^!!^!!^^^^^^!^!^^^^^^^^!^^!!^!!^!!^!!
    !!!^!!!!^!^!!!^!^^!^^!^^^!^!^^!!^!^^!!!!!!!!^!^^!]!^^^^^^^!!!^^!^!^^^B!^!!^!^^^!!!^^!^!^^^!!!!,!^!^!!!^^!!!!^!^__!_!_!EE!EEA>!!EE!!
    What I do not understand is why is samtools not reporting the consensus sequence as K ? Is this the reason why it is not called as variant position ?
    Thanks a lot for the answers

  • #2
    All those !'s are the lowest quality. It doesn't want to call a G because it thinks all the G calls are horribly unreliable.

    Did you run mpileup with the default settings? At my fingertips, I've got a similar case, with 2 SNPs that definitely sanger confirmed, but also had lousy quality scores. When I re-ran mpileup with -B, the quality scores improved to what the .sam file said they ought to be, and my two SNPs popped up.

    If you ran pileup, you should really get the newest version of samtools, and run mpileup. People will be less willing to troubleshoot software they know is superseded.

    Comment


    • #3
      do you have format the original data?If not ,I also understand...

      Comment


      • #4
        Thank you very much. It worked.

        Comment

        Latest Articles

        Collapse

        • seqadmin
          Best Practices for Single-Cell Sequencing Analysis
          by seqadmin



          While isolating and preparing single cells for sequencing was historically the bottleneck, recent technological advancements have shifted the challenge to data analysis. This highlights the rapidly evolving nature of single-cell sequencing. The inherent complexity of single-cell analysis has intensified with the surge in data volume and the incorporation of diverse and more complex datasets. This article explores the challenges in analysis, examines common pitfalls, offers...
          Today, 07:15 AM
        • seqadmin
          Latest Developments in Precision Medicine
          by seqadmin



          Technological advances have led to drastic improvements in the field of precision medicine, enabling more personalized approaches to treatment. This article explores four leading groups that are overcoming many of the challenges of genomic profiling and precision medicine through their innovative platforms and technologies.

          Somatic Genomics
          “We have such a tremendous amount of genetic diversity that exists within each of us, and not just between us as individuals,”...
          05-24-2024, 01:16 PM

        ad_right_rmr

        Collapse

        News

        Collapse

        Topics Statistics Last Post
        Started by seqadmin, Today, 08:18 AM
        0 responses
        8 views
        0 likes
        Last Post seqadmin  
        Started by seqadmin, Today, 08:04 AM
        0 responses
        10 views
        0 likes
        Last Post seqadmin  
        Started by seqadmin, 06-03-2024, 06:55 AM
        0 responses
        13 views
        0 likes
        Last Post seqadmin  
        Started by seqadmin, 05-30-2024, 03:16 PM
        0 responses
        27 views
        0 likes
        Last Post seqadmin  
        Working...
        X