Seqanswers Leaderboard Ad

Collapse

Announcement

Collapse
No announcement yet.
X
 
  • Filter
  • Time
  • Show
Clear All
new posts

  • GFF file for TopHat

    I am analyzing mRNA-Seq generated by solexa from human samples; I would like to run TopHat with a GFF file in order to obtain gene expression values (-G option).
    Can any body point out where can I get a GFF file for human genome?
    Thanks

  • #2
    I've been looking for the same file but for mouse.

    I did come across such files being available for many organisms (including human but not mouse). Try this: http://www.sequenceontology.org/reso...databases.html
    and please let me know if it works.

    If anybody knows where to find the same file for mouse, please let me know

    Comment


    • #3
      Hi there,

      How I did it, although there is definitely a better way. I downloaded the UCSC knowngene table for human in BED format and converted it to GFF2 (through GALAXY). I then converted that to GFF3 using the package on this site
      http://bcbio.wordpress.com/2009/03/2...sing-and-gff2/

      As I said there is definitely a better way to do this, I only needed to convert a single file so I never looked any further. However if anyone knows of a straight BED to GFF3 converter (or a package that can convert between other annotation file formats) please post it !!!

      Warren

      Comment

      Latest Articles

      Collapse

      • seqadmin
        Essential Discoveries and Tools in Epitranscriptomics
        by seqadmin


        The field of epigenetics has traditionally concentrated more on DNA and how changes like methylation and phosphorylation of histones impact gene expression and regulation. However, our increased understanding of RNA modifications and their importance in cellular processes has led to a rise in epitranscriptomics research. “Epitranscriptomics brings together the concepts of epigenetics and gene expression,” explained Adrien Leger, PhD, Principal Research Scientist on Modified Bases...
        Yesterday, 07:01 AM
      • seqadmin
        Current Approaches to Protein Sequencing
        by seqadmin


        Proteins are often described as the workhorses of the cell, and identifying their sequences is key to understanding their role in biological processes and disease. Currently, the most common technique used to determine protein sequences is mass spectrometry. While still a valuable tool, mass spectrometry faces several limitations and requires a highly experienced scientist familiar with the equipment to operate it. Additionally, other proteomic methods, like affinity assays, are constrained...
        04-04-2024, 04:25 PM

      ad_right_rmr

      Collapse

      News

      Collapse

      Topics Statistics Last Post
      Started by seqadmin, 04-11-2024, 12:08 PM
      0 responses
      55 views
      0 likes
      Last Post seqadmin  
      Started by seqadmin, 04-10-2024, 10:19 PM
      0 responses
      51 views
      0 likes
      Last Post seqadmin  
      Started by seqadmin, 04-10-2024, 09:21 AM
      0 responses
      45 views
      0 likes
      Last Post seqadmin  
      Started by seqadmin, 04-04-2024, 09:00 AM
      0 responses
      55 views
      0 likes
      Last Post seqadmin  
      Working...
      X