Unconfigured Ad

Collapse
X
 
  • Filter
  • Time
  • Show
Clear All
new posts
  • dawn1313
    Member
    • Aug 2015
    • 21

    Virus reference annotation

    Hi,

    I am trying to assemble a virus transcriptome using TopHat/Cufflinks. The reference genome is http://www.ncbi.nlm.nih.gov/nuccore/EF999921, and I wonder how I can get the annotation file (gtf/gff).

    Thank you!
    Dawn
  • GenoMax
    Senior Member
    • Feb 2008
    • 7142

    #2
    You could try this script to see if it will convert genbank format file to GFF: http://www.hpa-bioinformatics.org.uk...s/snippets/115

    Comment

    • Chinboy
      Junior Member
      • Aug 2015
      • 2

      #3
      > library(cummeRbund)
      载入需要的程辑包:BiocGenerics
      载入需要的程辑包:parallel

      载入程辑包:‘BiocGenerics’

      The following objects are masked from ‘packagearallel’:

      clusterApply, clusterApplyLB, clusterCall, clusterEvalQ,
      clusterExport, clusterMap, parApply, parCapply, parLapply,
      parLapplyLB, parRapply, parSapply, parSapplyLB

      The following object is masked from ‘package:stats’:

      xtabs

      The following objects are masked from ‘package:base’:

      anyDuplicated, append, as.data.frame, as.vector, cbind, colnames,
      do.call, duplicated, eval, evalq, Filter, Find, get, intersect,
      is.unsorted, lapply, Map, mapply, match, mget, order, paste, pmax,
      pmax.int, pmin, pmin.int, Position, rank, rbind, Reduce, rep.int,
      rownames, sapply, setdiff, sort, table, tapply, union, unique,
      unlist, unsplit

      载入需要的程辑包:RSQLite
      载入需要的程辑包:DBI
      载入需要的程辑包:ggplot2
      载入需要的程辑包:reshape2
      载入需要的程辑包:fastcluster

      载入程辑包:‘fastcluster’

      The following object is masked from ‘package:stats’:

      hclust

      载入需要的程辑包:rtracklayer
      载入需要的程辑包:GenomicRanges
      载入需要的程辑包:S4Vectors
      载入需要的程辑包:stats4
      Creating a generic function for ‘nchar’ from package ‘base’ in package ‘S4Vectors’
      载入需要的程辑包:IRanges
      载入需要的程辑包:GenomeInfoDb
      载入需要的程辑包:Gviz
      载入需要的程辑包:grid

      载入程辑包:‘cummeRbund’

      The following object is masked from ‘package:GenomicRanges’:

      promoters

      The following object is masked from ‘package:IRanges’:

      promoters

      The following object is masked from ‘package:BiocGenerics’:

      conditions

      Comment

      • dawn1313
        Member
        • Aug 2015
        • 21

        #4
        Hi thanks for the response. Well my question is NOT the file format transfer, but how to obtain the annotation file of the virus.

        Thank!
        Dawn

        Comment

        • GenoMax
          Senior Member
          • Feb 2008
          • 7142

          #5
          @Dawn: I don't think your particular genome of interest is available in the list of viral genomes @NCBI (ftp://ftp.ncbi.nlm.nih.gov/genomes/Viruses/) so there is no ready GFF file. I was thus suggesting that you download the genbank format file for your accession from the link you had in your post and do the conversion to a GFF format file yourself.

          Comment

          • piet
            Member
            • Aug 2014
            • 21

            #6
            Originally posted by dawn1313 View Post
            The reference genome is http://www.ncbi.nlm.nih.gov/nuccore/EF999921, and I wonder how I can get the annotation file (gtf/gff).
            Code:
            wget http://togows.dbcls.jp/entry/nucleotide/EF999921.1.gff

            Comment

            • dawn1313
              Member
              • Aug 2015
              • 21

              #7
              Thank you so much Piet and that's what I am looking for.

              Best,
              Dawn

              Comment

              Latest Articles

              Collapse

              • SEQadmin2
                Nine Things a Sample Prep Scientist Thinks About Before Sequencing
                by SEQadmin2


                I’m not a sequencing expert. I’m a purification scientist who uses NGS to evaluate workflows my group develops. With this perspective, we think about the sample first and the NGS workflow second. The sequencer is an exceptionally honest reporter, but it can only report on what you give it, so whether you get clean, interpretable data from an NGS workflow is largely determined before you begin.

                Here are nine questions we think about, in roughly the order they matter, before...
                06-18-2026, 07:11 AM
              • SEQadmin2
                From Collection to Sequencing: Why Sample Preparation and Preservation Define Sequencing Data
                by SEQadmin2


                Data variability is still an issue in sequencing technologies despite the advances in reproducibility and accuracy of these platforms. But the problem does not originate in the sequencing itself, but in the previous steps, before the sample reaches the sequencer.


                The first step is collection, followed by preservation and sample preparation for analysis. Most scientists overlook those steps, but not being careful might just be skewing the experiment’s results.
                ...
                06-02-2026, 10:05 AM

              ad_right_rmr

              Collapse

              News

              Collapse

              Topics Statistics Last Post
              Started by SEQadmin2, 06-26-2026, 11:10 AM
              0 responses
              15 views
              0 reactions
              Last Post SEQadmin2  
              Started by SEQadmin2, 06-17-2026, 06:09 AM
              0 responses
              49 views
              0 reactions
              Last Post SEQadmin2  
              Started by SEQadmin2, 06-09-2026, 11:58 AM
              0 responses
              107 views
              0 reactions
              Last Post SEQadmin2  
              Started by SEQadmin2, 06-05-2026, 10:09 AM
              0 responses
              125 views
              0 reactions
              Last Post SEQadmin2  
              Working...