Hello,
I've performed an exome alignement (paired end reads) by using bfast match + localalign + postprocess, thereafter I've removed duplicates by Picard and when running the local realignement, during the GATK Indel Religner step I get the following error:
This is how the bam file looks:
So I guess the GATK it's right. My question is:
- I've runned the bfast postprocess with the '-a 3 -z' argument, so is it not supposed that it takes only one single alignement for each read?
- anyway, can I somehow say to the GATK to ignore these "conflictive" reads? I've tried with the '--validation_strictness SILENT' but it is still complaining.
Well, I'm pretty jammed with that, any help will be much appreciated. And merry christmas, by the way!
thanks,
david
I've performed an exome alignement (paired end reads) by using bfast match + localalign + postprocess, thereafter I've removed duplicates by Picard and when running the local realignement, during the GATK Indel Religner step I get the following error:
Code:
##### ERROR MESSAGE: Error caching SAM record HWUSI-EAS1692_0001:3:55:5381:15775#0, which is usually caused by malformed SAM/BAM files in which multiple identical copies of a read are present.
Code:
HWUSI-EAS1692_0001:3:55:5381:15775#0 179 chr1 148354187 0 95M = 148354236 49TAGCATCTTTCACAAAGCTCTCTGTGTTTGAGTACGCACCTTGATCCATAGGCTCACATTTGATCCCAACTGGCGGCTGCTTCTTGGCATTAACT DGFBGGGGGGGGGGGGGGGGGGGBGFFGGGFGDGEGGAGFEGDGGGGFEGEEGBGGGGGEGDBDEDEEDBA??EEA?################## XA:i:3 MD:Z:95 PG:Z:bfast RG:Z:012_t_l1 IH:i:1 NH:i:11 HI:i:1 NM:i:0 MQ:i:0 AS:i:4750 HWUSI-EAS1692_0001:3:55:5381:15775#0 81 chr1 148354236 0 95M = 148568822 214586 AGGCTCACATTTGATCCCAACTGGCGGCTGCTTCTTGGCATTAACTTTGGATTCCCAACCAGTAAATCTTACCAAGATCTGAGTTTCTCCAGGTA @AABAC<CA@>>>=4>=>=3DCDCFEDEGEECDF?DEFGFFEDCEDDDEEEDDGEGFGGGGGEGGGFGFGGDGGGGGGFGGFFGGGGGGGEGGGB XA:i:3 MD:Z:95 PG:Z:bfastRG:Z:012_t_l2 IH:i:1 NH:i:11 HI:i:1 NM:i:0 MQ:i:0 AS:i:4750 HWUSI-EAS1692_0001:3:55:5381:15775#0 115 chr1 148354236 0 95M = 148354187 -49AGGCTCACATTTGATCCCAACTGGCGGCTGCTTCTTGGCATTAACTTTGGATTCCCAACCAGTAAATCTTACCAAGATCTGAGTTTCTCCAGGTA @AABAC<CA@>>>=4>=>=3DCDCFEDEGEECDF?DEFGFFEDCEDDDEEEDDGEGFGGGGGEGGGFGFGGDGGGGGGFGGFFGGGGGGGEGGGB XA:i:3 MD:Z:95 PG:Z:bfast RG:Z:012_t_l1 IH:i:1 NH:i:11 HI:i:1 NM:i:0 MQ:i:0 AS:i:4750 HWUSI-EAS1692_0001:3:55:5381:15775#0 161 chr1 148568822 0 95M = 148354236 -214586 AGTTAATGCCAAGAAGCAGCCGCCAGTTGGGATCAAATGTGAGCCTATGGATCAAGGTGCGTACTCAAACACAGAGAGCTTTGTGAAAGATGCTA ##################?AEE??ABDEEDEDBDGEGGGGGBGEEGEFGGGGDGEFGAGGEGDGFGGGFFGBGGGGGGGGGGGGGGGGGGGBFGD XA:i:3 MD:Z:95 PG:Z:bfastRG:Z:012_t_l2 IH:i:1 NH:i:11 HI:i:1 NM:i:0 MQ:i:0 AS:i:4750
- I've runned the bfast postprocess with the '-a 3 -z' argument, so is it not supposed that it takes only one single alignement for each read?
- anyway, can I somehow say to the GATK to ignore these "conflictive" reads? I've tried with the '--validation_strictness SILENT' but it is still complaining.
Well, I'm pretty jammed with that, any help will be much appreciated. And merry christmas, by the way!
thanks,
david
Comment