Seqanswers Leaderboard Ad

Collapse
X
 
  • Filter
  • Time
  • Show
Clear All
new posts
  • Magas8282
    Junior Member
    • Nov 2012
    • 2

    UnifiedGenotyper vs. Varscan vs. FreeBayes for Bacterial spp. Variant Calling

    Hello all,

    After trolling the forum for some time trying to find an answer to my question, I decided to reach out to the community for some additional assistance. Currently, my group is working on SNP detection in bacterial species for phylogenetic analyses. I am fairly new to the bioinformatics but am luckily working on these data within a local implementation of Galaxy.

    At this point, I am trying to compare SNP calls between UnifiedGenotyper, Mpileup/Varscan, and FreeBayes, all which seem to be popular algorithms/programs. For our analysis, we are using short read paired-end illumina data (2x100) with usually 25-30 samples.

    My first question is which algorithm/program is preferred for bacterial species, or are there any best practices for these analyses. Secondly, and most importantly, I am having trouble finding criteria/guidelines for SNP filtering. On the GATK site and essentially everyplace else I found, it says that this process is highly subjective. However, I need a starting place, or some criteria to base my subjective decisions on. For example, should I base the decision strictly on the QUAL value, or a number of values including the gentoype quality, read depth, or mapping quality. My ultimate goal is the reduce false positive SNP calls (which I assume would have to do with some probability) as well as SNPs in areas with low coverage.

    Any assistance would be greatly appreciated.
  • swbarnes2
    Senior Member
    • May 2008
    • 910

    #2
    If you are looking for variants in a single clone, and your coverage is adequate (say, at least half a million reads per Mb of genome) it doesn't matter much what program you use, those SNPs will be so strong that any software will be able to pick them out.

    If you were doing, say, a plant genome, which is complex and repetitive, and you had varying coverage, and lots of it was < 10x covered, then it would matter.

    Every genome and program has its own quirks. Pick one program, which ever looks the best documented, or which ever is simplest to use, and stick with that. Use IGV to eyeball SNPs that the software calls, maybe do some sanger sequencing to confirm. All in all, it would be better for you to use slightly inferior software if it's well known in the community (meaning you can get help with it) and if you have a decent understanding of how its working, and what its limitations are, than to use technically better software that neither you nor anyone else understands well.

    Comment

    Latest Articles

    Collapse

    • seqadmin
      New Genomics Tools and Methods Shared at AGBT 2025
      by seqadmin


      This year’s Advances in Genome Biology and Technology (AGBT) General Meeting commemorated the 25th anniversary of the event at its original venue on Marco Island, Florida. While this year’s event didn’t include high-profile musical performances, the industry announcements and cutting-edge research still drew the attention of leading scientists.

      The Headliner
      The biggest announcement was Roche stepping back into the sequencing platform market. In the years since...
      03-03-2025, 01:39 PM
    • seqadmin
      Investigating the Gut Microbiome Through Diet and Spatial Biology
      by seqadmin




      The human gut contains trillions of microorganisms that impact digestion, immune functions, and overall health1. Despite major breakthroughs, we’re only beginning to understand the full extent of the microbiome’s influence on health and disease. Advances in next-generation sequencing and spatial biology have opened new windows into this complex environment, yet many questions remain. This article highlights two recent studies exploring how diet influences microbial...
      02-24-2025, 06:31 AM

    ad_right_rmr

    Collapse

    News

    Collapse

    Topics Statistics Last Post
    Started by seqadmin, 03-20-2025, 05:03 AM
    0 responses
    17 views
    0 reactions
    Last Post seqadmin  
    Started by seqadmin, 03-19-2025, 07:27 AM
    0 responses
    18 views
    0 reactions
    Last Post seqadmin  
    Started by seqadmin, 03-18-2025, 12:50 PM
    0 responses
    19 views
    0 reactions
    Last Post seqadmin  
    Started by seqadmin, 03-03-2025, 01:15 PM
    0 responses
    186 views
    0 reactions
    Last Post seqadmin  
    Working...