Seqanswers Leaderboard Ad

Collapse

Announcement

Collapse
No announcement yet.
X
 
  • Filter
  • Time
  • Show
Clear All
new posts

  • How does NCBI populate data fro gene entries? I want to get all refseq mRNA..

    Hi All,

    I'm just wondering if anyone can shed light on how to obtain the latest annotations of a given organism from NCBI, and more specifically how to get all of the current transcript variants that are listed as refseq's..

    I'm working on honeybees and I've grabbed the latest gff flatfile annotations from ftp://ftp.ncbi.nih.gov/genomes/Apis_mellifera/GFF/ but they don't contain all of the current refseq transcripts..

    For example, the gene cort (http://www.ncbi.nlm.nih.gov/gene/726912) has three transcript variants listed as refseq entries; XM_006557348.1, XM_001122629.3 and XM_006557349.1, but in the gff annotations the only transcript ID is XM_001122629.2...

    Is there any way to build a current set of annotations from the data NCBI uses to populate transcripts for gene records?


    Thanks

  • #2
    You can do this in a couple of different ways. One would be to get the invertebrate RefSeq data files (ftp://ftp.ncbi.nlm.nih.gov/refseq/release/invertebrate/) and parse out Apis mellifera entries.

    A simpler way would be to do a search: http://www.ncbi.nlm.nih.gov/nuccore/...D+srcdb_refseq. Change the "Display settings" to indicate the format you want (GenBank, Fasta etc) and then "Send to" a File. I currently see ~40,500 entries for the search above.

    Keep in mind that the above list will likely include all of these RefSeq types (http://www.ncbi.nlm.nih.gov/books/NB...ort=objectonly).
    Last edited by GenoMax; 05-09-2014, 03:21 AM.

    Comment


    • #3
      I downloaded ref_Amel_4.5_scaffolds.gff3

      And I see XM_006557348.1, XM_001122629.3 and XM_006557349.1 inside...

      Comment


      • #4
        Thanks for your replies..

        Definitely operator error...

        I need to start appending version dates as I've been looking at the wrong flatfile ....

        Comment

        Latest Articles

        Collapse

        • seqadmin
          Recent Advances in Sequencing Analysis Tools
          by seqadmin


          The sequencing world is rapidly changing due to declining costs, enhanced accuracies, and the advent of newer, cutting-edge instruments. Equally important to these developments are improvements in sequencing analysis, a process that converts vast amounts of raw data into a comprehensible and meaningful form. This complex task requires expertise and the right analysis tools. In this article, we highlight the progress and innovation in sequencing analysis by reviewing several of the...
          05-06-2024, 07:48 AM
        • seqadmin
          Essential Discoveries and Tools in Epitranscriptomics
          by seqadmin




          The field of epigenetics has traditionally concentrated more on DNA and how changes like methylation and phosphorylation of histones impact gene expression and regulation. However, our increased understanding of RNA modifications and their importance in cellular processes has led to a rise in epitranscriptomics research. “Epitranscriptomics brings together the concepts of epigenetics and gene expression,” explained Adrien Leger, PhD, Principal Research Scientist...
          04-22-2024, 07:01 AM

        ad_right_rmr

        Collapse

        News

        Collapse

        Topics Statistics Last Post
        Started by seqadmin, Yesterday, 06:57 AM
        0 responses
        12 views
        0 likes
        Last Post seqadmin  
        Started by seqadmin, 05-06-2024, 07:17 AM
        0 responses
        16 views
        0 likes
        Last Post seqadmin  
        Started by seqadmin, 05-02-2024, 08:06 AM
        0 responses
        19 views
        0 likes
        Last Post seqadmin  
        Started by seqadmin, 04-30-2024, 12:17 PM
        0 responses
        24 views
        0 likes
        Last Post seqadmin  
        Working...
        X