First post as a user here, so please go easy on me for lack of due diligence
We have a genome assembled from Illumina data. There is a reference genome of a closely related species (same genus). I downloaded the proteins from this reference genome and sought to map them to our genome using local tblastn, as a homology-based annotation (we also have predicted transcripts from MAKER as an ab initio annotation method).
I have seen this method used in the literature, but all the method descriptions skip an important step - actually physically mapping the best tblastn hits (from whatever criteria) to the genome.
I assume there is some way to convert the blast xml output to an annotation file (GFF or similar) - one that conserves the info from the blast (especially protein name and function). I tried looking into Biopython and BioPerl but could not lay hands on the proper method of doing this.
Can someone please point me in the right direction?
We have a genome assembled from Illumina data. There is a reference genome of a closely related species (same genus). I downloaded the proteins from this reference genome and sought to map them to our genome using local tblastn, as a homology-based annotation (we also have predicted transcripts from MAKER as an ab initio annotation method).
I have seen this method used in the literature, but all the method descriptions skip an important step - actually physically mapping the best tblastn hits (from whatever criteria) to the genome.
I assume there is some way to convert the blast xml output to an annotation file (GFF or similar) - one that conserves the info from the blast (especially protein name and function). I tried looking into Biopython and BioPerl but could not lay hands on the proper method of doing this.
Can someone please point me in the right direction?
Comment