Hello everybody, since everybody is getting on the variant calling from RNAseq bandwagon, my P.I wants to get in on the party as well. Too bad for overworked ppl like me
So I have this huge dataset of illumina paired end rnaseq which I am trying to run through GATK. I am taking the accepted_hits.bam file from tophat/cufflinks as the input file for GATK. My reference for the tophat/cufflinks pipeline was UCSC annotated genes (knownGenes.gtf) and UCSC hg19 reference.
My pipeline for gatk is:
convert bam>sam, sort sam> insert read groups> fixmates using picard> sam to bam> remove duplicates > reindex bam > realign indels
When i Run the indel realignment for GATK, I get the following error: contig chr 1 missing from reference. So i go ahead and look it up, and use the reorder sam option in picards tools. I modify my pipeline as follows:
convert bam>sam, sort sam> insert read groups> fixmates using picard> createdictionary.jar for hg 19 reference using picard > reorder sam >
sam to bam> remove duplicates > reindex bam > realign indels
My error is not solved even after reordering the sam file and i still get the same error of "chr1 contig not found in your reference"
Is it something to do with the references I have been using? I have used hg19 reference both for my tophat/cufflinks as well as the GATK pipeline.
Thanks a ton in advance!

My pipeline for gatk is:
convert bam>sam, sort sam> insert read groups> fixmates using picard> sam to bam> remove duplicates > reindex bam > realign indels
When i Run the indel realignment for GATK, I get the following error: contig chr 1 missing from reference. So i go ahead and look it up, and use the reorder sam option in picards tools. I modify my pipeline as follows:
convert bam>sam, sort sam> insert read groups> fixmates using picard> createdictionary.jar for hg 19 reference using picard > reorder sam >
sam to bam> remove duplicates > reindex bam > realign indels
My error is not solved even after reordering the sam file and i still get the same error of "chr1 contig not found in your reference"

Is it something to do with the references I have been using? I have used hg19 reference both for my tophat/cufflinks as well as the GATK pipeline.
Thanks a ton in advance!
Comment