I am attempting to detect indels from a panel of clones resulting from CRISPR targeted deletion. Regions around the target were PCR amplified to produce a roughly 160bp amplicon, which was then sequenced with as a PE150 run.
I've been banging my head against finding a tool that can:
1) Detect which clones have indels
2) Identify the location of these indels, ideally in a VCF or similar file such that the full panel of 96 clones can visualized (i.e. in IGV).
3) Provide annotation details about the read depth at the indel position and percentage of the sequences that contain an indel.
4) Is a stand-alone tool that I can install on my Unix box.
This question is very similar to one asked back in 2012, which was never sufficiently answered.
So far, I have tried a number of methods, each of which have some key failings.
CRISPR Genome Analyzer:
crispr-ga.net
Doesn't localize the indel detected. Only available as a web app; I need something that I can install locally and hammer on.
Outknocker:
Doesn't give localizations. Web based.
Pindel:
Pindel ends up missing a number of indels that are clearly there when viewing the reads from the BAM file. Also, pindel format is not good for looking across many clones, and the conversion pindel-to-vcf loses any depth and read-percentage information.
GATK HaplotypeCaller:
Misses a lot of very obvious indels. The caller appears to not be optimized for non-tiled non-fragmented reads.
GASVPRO and SVseq:
Simply don't detect any indels where they clearly exist. Again, the callers seem non-optimized for amplicon sequencing.
XHMM:
Refuses to do the PCA normalization with fewer targets than samples
CONTRA:
Doesn't deal well with small analysis windows for detecting indels shorter than 11bp.
CRISPR-GA uses a BLAT based alignment for their backend, but I can't find any additional information on how I would do further indel detection after a basic BLAT alignment. Any ideas on a BLAT based indel calling solution?
This seems like it should be a dead simple problem, something that was solved back in the 80's, but I can't find a good reference to a tool to use. Any help is appreciated.
Thanks
I've been banging my head against finding a tool that can:
1) Detect which clones have indels
2) Identify the location of these indels, ideally in a VCF or similar file such that the full panel of 96 clones can visualized (i.e. in IGV).
3) Provide annotation details about the read depth at the indel position and percentage of the sequences that contain an indel.
4) Is a stand-alone tool that I can install on my Unix box.
This question is very similar to one asked back in 2012, which was never sufficiently answered.
So far, I have tried a number of methods, each of which have some key failings.
CRISPR Genome Analyzer:
crispr-ga.net
Doesn't localize the indel detected. Only available as a web app; I need something that I can install locally and hammer on.
Outknocker:
Doesn't give localizations. Web based.
Pindel:
Pindel ends up missing a number of indels that are clearly there when viewing the reads from the BAM file. Also, pindel format is not good for looking across many clones, and the conversion pindel-to-vcf loses any depth and read-percentage information.
GATK HaplotypeCaller:
Misses a lot of very obvious indels. The caller appears to not be optimized for non-tiled non-fragmented reads.
GASVPRO and SVseq:
Simply don't detect any indels where they clearly exist. Again, the callers seem non-optimized for amplicon sequencing.
XHMM:
Refuses to do the PCA normalization with fewer targets than samples
CONTRA:
Doesn't deal well with small analysis windows for detecting indels shorter than 11bp.
CRISPR-GA uses a BLAT based alignment for their backend, but I can't find any additional information on how I would do further indel detection after a basic BLAT alignment. Any ideas on a BLAT based indel calling solution?
This seems like it should be a dead simple problem, something that was solved back in the 80's, but I can't find a good reference to a tool to use. Any help is appreciated.
Thanks
Comment