Seqanswers Leaderboard Ad

Collapse

Announcement

Collapse
No announcement yet.
X
 
  • Filter
  • Time
  • Show
Clear All
new posts

  • Blast+ database with gene annotation

    Hello,

    I have been trying for the last few days to make a blast database (with blast+ tools) that would contain the gene information from a genebank file. Is it possible to do that with blast+ or with bioperl?
    I have been looking at bioperl and it looks like you can add the annotation information to a sequence object, but I don't relly understand which object should I use. There is not a lot of information on this, not even in the bioperl docs.

    Thanks,
    Andrei

  • #2
    Andrei,

    BLAST databases aren't designed to contain a large amount of annotation about a sequence, and you can't create one directly from a GenBank file. The input to makeblastdb must be a FASTA formatted file (or an existing BLASTDB you want to alter). The text on the definition (description) line for each sequence is stored in the BLASTdb but just as a plain text string.

    The more reasonable approach would be to read the accession or gi numbers from the BLAST hits and then use those to query GenBank or a local database for annotation.

    Comment


    • #3
      OK, so blast+ does not do this automatically. Then I should be able to do this with bioperl. There should be an object that can get the genbank information from a file and then you could compare it to a SeqIO object that contains the corresponding sequence.

      What I want to do is compare a query sequence to a blast database and see which gene from that database corresponds to that query. Also print out the parcentage of the matching nucleotides.

      In order to do this I think I must first get the sequence for each gene that is found in the genebank file from the blast database. Then BLAST my query sequence against the gene sequences and get the one that matches the best.

      Does this make any sense to anyone else?

      Andrei

      Comment


      • #4
        Why not follow kmcarr's suggestion - first Blast, then lookup the accessions in a annotation file or db?

        That's how I'd do it too.

        Comment


        • #5
          And how do I do that? The computer that I am working from has no internet connection, so I have to download all the genbank files that contain the gene information. I can make a local database from them too, as you can make with the fasta sequences?
          Last edited by andreitudor; 03-03-2011, 08:32 AM.

          Comment

          Latest Articles

          Collapse

          • seqadmin
            Exploring the Dynamics of the Tumor Microenvironment
            by seqadmin




            The complexity of cancer is clearly demonstrated in the diverse ecosystem of the tumor microenvironment (TME). The TME is made up of numerous cell types and its development begins with the changes that happen during oncogenesis. “Genomic mutations, copy number changes, epigenetic alterations, and alternative gene expression occur to varying degrees within the affected tumor cells,” explained Andrea O’Hara, Ph.D., Strategic Technical Specialist at Azenta. “As...
            07-08-2024, 03:19 PM
          • seqadmin
            Exploring Human Diversity Through Large-Scale Omics
            by seqadmin


            In 2003, researchers from the Human Genome Project (HGP) announced the most comprehensive genome to date1. Although the genome wasn’t fully completed until nearly 20 years later2, numerous large-scale projects, such as the International HapMap Project and 1000 Genomes Project, continued the HGP's work, capturing extensive variation and genomic diversity within humans. Recently, newer initiatives have significantly increased in scale and expanded beyond genomics, offering a more detailed...
            06-25-2024, 06:43 AM

          ad_right_rmr

          Collapse

          News

          Collapse

          Topics Statistics Last Post
          Started by seqadmin, Yesterday, 05:49 AM
          0 responses
          15 views
          0 likes
          Last Post seqadmin  
          Started by seqadmin, 07-15-2024, 06:53 AM
          0 responses
          26 views
          0 likes
          Last Post seqadmin  
          Started by seqadmin, 07-10-2024, 07:30 AM
          0 responses
          37 views
          0 likes
          Last Post seqadmin  
          Started by seqadmin, 07-03-2024, 09:45 AM
          0 responses
          204 views
          0 likes
          Last Post seqadmin  
          Working...
          X