Hi
I am having trouble with GATK ability to read my BAM files. THe BAM were created using tophat 2.0.0.4 and I used AddandReplaceReadGroups from Picard tools to do it. The code used was
java -Xmx1g -jar ~/programs/picard-tools-1.47/AddOrReplaceReadGroups.jar I=/home/sudeep/work/6-20-12/layers/all_infected_bams/1_4I_accepted_hits.bam O=/home/sudeep/work/6-20-12/layers/all_infected_bams/1_4I_accepted_hits_RG.bam SORT_ORDER=coordinate RGLB=Infected RGPL=illumina RGPU=HSWI72892 RGSM=1_4I.
I did use the VALIDATION_STRINGENCY=LENIENT, but to effect. I do index the BAM files. I even tried SortSAM to see if i had a problem. I looked at another thread posted here but nothing happened...
The GATK run code is below....
java -Xmx4g -jar GenomeAnalysisTK.jar -R chicken_order.fa --default_platform illumina --knownSites:variant,vcf ./trial_middle.vcf -I /home/sudeep/work/6-20-12/layers/all_infected_bams/1_4I_accepted_hits_RG_reorder.bam -T CountCovariates -cov ReadGroupcovariate -cov QualityScoreCovariate -cov CycleCovariate -cov DinucCovariate -recalFile /home/sudeep/work/6-20-12/layer/all_infected_bams/1_4I_recaldata.csv
INFO 02:04:00,711 HelpFormatter - ---------------------------------------------------------------------------------
INFO 02:04:00,714 HelpFormatter - The Genome Analysis Toolkit (GATK) v1.6-11-g3b2fab9, Compiled 2012/06/20 13:28:25
INFO 02:04:00,714 HelpFormatter - Copyright (c) 2010 The Broad Institute
INFO 02:04:00,714 HelpFormatter - Please view our documentation at http://www.broadinstitute.org/gsa/wiki
INFO 02:04:00,715 HelpFormatter - For support, please view our support site at http://getsatisfaction.com/gsa
INFO 02:04:00,715 HelpFormatter - Program Args: -R chicken_order.fa --default_platform illumina --knownSites:variant,vcf ./trial_middle.vcf -I /home/sudeep/work/6-20-12/layers/all_infected_bams/1_4I_accepted_hits_RG_reorder.bam -T CountCovariates -cov ReadGroupcovariate -cov QualityScoreCovariate -cov CycleCovariate -cov DinucCovariate -recalFile /home/sudeep/work/6-20-12/layer/all_infected_bams/1_4I_recaldata.csv
INFO 02:04:00,716 HelpFormatter - Date/Time: 2012/06/29 02:04:00
INFO 02:04:00,716 HelpFormatter - ---------------------------------------------------------------------------------
INFO 02:04:00,716 HelpFormatter - ---------------------------------------------------------------------------------
INFO 02:04:00,737 GenomeAnalysisEngine - Strictness is SILENT
INFO 02:04:00,822 SAMDataSource$SAMReaders - Initializing SAMRecords in serial
INFO 02:04:00,851 SAMDataSource$SAMReaders - Done initializing BAM readers: total time 0.03
INFO 02:04:00,867 RMDTrackBuilder - Loading Tribble index from disk for file ./trial_middle.vcf
INFO 02:04:01,774 CountCovariatesWalker - The covariates being used here:
INFO 02:04:01,774 CountCovariatesWalker - ReadGroupCovariate
INFO 02:04:01,774 CountCovariatesWalker - QualityScoreCovariate
INFO 02:04:01,775 CountCovariatesWalker - CycleCovariate
INFO 02:04:01,775 CountCovariatesWalker - DinucCovariate
INFO 02:04:01,854 TraversalEngine - [INITIALIZATION COMPLETE; TRAVERSAL STARTING]
INFO 02:04:01,855 TraversalEngine - Location processed.sites runtime per.1M.sites completed total.runtime remaining
INFO 02:04:03,192 GATKRunReport - Uploaded run statistics report to AWS S3
##### ERROR ------------------------------------------------------------------------------------------
##### ERROR A USER ERROR has occurred (version 1.6-11-g3b2fab9):
##### ERROR The invalid arguments or inputs must be corrected before the GATK can proceed
##### ERROR Please do not post this error to the GATK forum
##### ERROR
##### ERROR See the documentation (rerun with -h) for this tool to view allowable command-line arguments.
##### ERROR Visit our wiki for extensive documentation http://www.broadinstitute.org/gsa/wiki
##### ERROR Visit our forum to view answers to commonly asked questions http://getsatisfaction.com/gsa
##### ERROR
##### ERROR MESSAGE: SAM/BAM file SAMFileReader{/home/sudeep/work/6-20-12/layers/all_infected_bams/1_4I_accepted_hits_RG_reorder.bam} is malformed: BAM file has a read with mismatching number of bases and base qualities. Offender: HWI-ST913:105:C0EYJACXX:5:1304:11235:16705 [100 bases] [0 quals]
##### ERROR -------------------------------------
Please help. I could be something very simple
I am having trouble with GATK ability to read my BAM files. THe BAM were created using tophat 2.0.0.4 and I used AddandReplaceReadGroups from Picard tools to do it. The code used was
java -Xmx1g -jar ~/programs/picard-tools-1.47/AddOrReplaceReadGroups.jar I=/home/sudeep/work/6-20-12/layers/all_infected_bams/1_4I_accepted_hits.bam O=/home/sudeep/work/6-20-12/layers/all_infected_bams/1_4I_accepted_hits_RG.bam SORT_ORDER=coordinate RGLB=Infected RGPL=illumina RGPU=HSWI72892 RGSM=1_4I.
I did use the VALIDATION_STRINGENCY=LENIENT, but to effect. I do index the BAM files. I even tried SortSAM to see if i had a problem. I looked at another thread posted here but nothing happened...
The GATK run code is below....
java -Xmx4g -jar GenomeAnalysisTK.jar -R chicken_order.fa --default_platform illumina --knownSites:variant,vcf ./trial_middle.vcf -I /home/sudeep/work/6-20-12/layers/all_infected_bams/1_4I_accepted_hits_RG_reorder.bam -T CountCovariates -cov ReadGroupcovariate -cov QualityScoreCovariate -cov CycleCovariate -cov DinucCovariate -recalFile /home/sudeep/work/6-20-12/layer/all_infected_bams/1_4I_recaldata.csv
INFO 02:04:00,711 HelpFormatter - ---------------------------------------------------------------------------------
INFO 02:04:00,714 HelpFormatter - The Genome Analysis Toolkit (GATK) v1.6-11-g3b2fab9, Compiled 2012/06/20 13:28:25
INFO 02:04:00,714 HelpFormatter - Copyright (c) 2010 The Broad Institute
INFO 02:04:00,714 HelpFormatter - Please view our documentation at http://www.broadinstitute.org/gsa/wiki
INFO 02:04:00,715 HelpFormatter - For support, please view our support site at http://getsatisfaction.com/gsa
INFO 02:04:00,715 HelpFormatter - Program Args: -R chicken_order.fa --default_platform illumina --knownSites:variant,vcf ./trial_middle.vcf -I /home/sudeep/work/6-20-12/layers/all_infected_bams/1_4I_accepted_hits_RG_reorder.bam -T CountCovariates -cov ReadGroupcovariate -cov QualityScoreCovariate -cov CycleCovariate -cov DinucCovariate -recalFile /home/sudeep/work/6-20-12/layer/all_infected_bams/1_4I_recaldata.csv
INFO 02:04:00,716 HelpFormatter - Date/Time: 2012/06/29 02:04:00
INFO 02:04:00,716 HelpFormatter - ---------------------------------------------------------------------------------
INFO 02:04:00,716 HelpFormatter - ---------------------------------------------------------------------------------
INFO 02:04:00,737 GenomeAnalysisEngine - Strictness is SILENT
INFO 02:04:00,822 SAMDataSource$SAMReaders - Initializing SAMRecords in serial
INFO 02:04:00,851 SAMDataSource$SAMReaders - Done initializing BAM readers: total time 0.03
INFO 02:04:00,867 RMDTrackBuilder - Loading Tribble index from disk for file ./trial_middle.vcf
INFO 02:04:01,774 CountCovariatesWalker - The covariates being used here:
INFO 02:04:01,774 CountCovariatesWalker - ReadGroupCovariate
INFO 02:04:01,774 CountCovariatesWalker - QualityScoreCovariate
INFO 02:04:01,775 CountCovariatesWalker - CycleCovariate
INFO 02:04:01,775 CountCovariatesWalker - DinucCovariate
INFO 02:04:01,854 TraversalEngine - [INITIALIZATION COMPLETE; TRAVERSAL STARTING]
INFO 02:04:01,855 TraversalEngine - Location processed.sites runtime per.1M.sites completed total.runtime remaining
INFO 02:04:03,192 GATKRunReport - Uploaded run statistics report to AWS S3
##### ERROR ------------------------------------------------------------------------------------------
##### ERROR A USER ERROR has occurred (version 1.6-11-g3b2fab9):
##### ERROR The invalid arguments or inputs must be corrected before the GATK can proceed
##### ERROR Please do not post this error to the GATK forum
##### ERROR
##### ERROR See the documentation (rerun with -h) for this tool to view allowable command-line arguments.
##### ERROR Visit our wiki for extensive documentation http://www.broadinstitute.org/gsa/wiki
##### ERROR Visit our forum to view answers to commonly asked questions http://getsatisfaction.com/gsa
##### ERROR
##### ERROR MESSAGE: SAM/BAM file SAMFileReader{/home/sudeep/work/6-20-12/layers/all_infected_bams/1_4I_accepted_hits_RG_reorder.bam} is malformed: BAM file has a read with mismatching number of bases and base qualities. Offender: HWI-ST913:105:C0EYJACXX:5:1304:11235:16705 [100 bases] [0 quals]
##### ERROR -------------------------------------
Please help. I could be something very simple
Comment