We just released the Bayesian BSNP genotype caller we used to compare whole genome human sequences generated from differing technologies (454, Illumina, SOLiD, Sanger and Complete Genomics), and at differing mean depths. It requires a samtools pileup as input, but seems to compare favorably to maq and produces full joint distributions P(Geno & Data) so you can use both the posterior and the amount of data that supports that posterior, in downstream analysis. BSNP also offers full control over the genotype priors, and some sequencing technology specific error compensation (Illumina, Sanger, 454 and SOLiD). Finally, genotype calls are not directly dependent on the reference sequence, beyond any biases already embodied in the alignment.
The C++ source is available (no external dependencies) at http://compgen.bscb.cornell.edu/GPhoCS/BSNP/
Any feedback is welcome!
-- Brad
The C++ source is available (no external dependencies) at http://compgen.bscb.cornell.edu/GPhoCS/BSNP/
Any feedback is welcome!
-- Brad
Comment