Seqanswers Leaderboard Ad

Collapse

Announcement

Collapse
No announcement yet.
X
 
  • Filter
  • Time
  • Show
Clear All
new posts

  • Tophat v1.1 with GTF files

    I wanted to try tophat 1.1 with a UCSC supplied GTF file, but the binary (linux-x86_64) I downloaded keeps asking for a GFF file. Has anyone had success running this version with a GTF file?

  • #2
    I have had succes with the 1.1.0 binary. Are you sure you are running the updated TopHat and not an old one left on your system? You can check using the --version option when you run TopHat ($ tophat --version).

    Comment


    • #3
      It's working! I had some deprecated paths in my scripts which were causing the problem. Thanks for your help!

      Comment


      • #4
        where can I get reliable GTF annotation files?

        I'm trying to get the GTF files for hg18, preferably at isoform level, from UCSC browser portal, in order to run with Tophat and cufflinks, but apparently I can't find such files there.
        So far I've managed to download a table from here http://genome.ucsc.edu/cgi-bin/hgTables?command=start but I'm not quite sure if it's the right way to do it. How did you guys get the file?

        thanks

        Comment


        • #5
          You are correct in using the table browser. To download a GTF file of a track, you need to select GTF in the output format dropdown menu, type a name for the output file, for instance hg18.UCSCknowngene.isoforms.gtf, then click get output. That should get you a GTF file.

          Comment


          • #6
            yes that what I did, but the problem is that there are loads of options for group/track/table and I don't find any single combination to look significantly more appealing. I'm looking for a detailed annotation for hg18, currently I have taken, the GTF file with:

            Group: Genes and Gene prediction Tracks
            Track: RefSeq Genes
            Table: refGene

            but I dunno how sensible/standard choice it is! and I guess it does not contain isoform level annotation.

            Comment


            • #7
              RefSeq gene does include a lot of isoforms (any that have RefSeq mRNA entries), but there are certainly isoforms expected to be missing.

              Comment


              • #8
                So you can supply TopHat with a GTF file of annotated transcripts, which, using the --GTF option, will be the first place where reads are mapped, followed by the whole genome, with or without novel junction discovery in this second stage. As I understand it, this is after TopHat 1.4.
                I'm curious to know how t was before 1.4. I think you could already give TopHat a GTF file, but it used it second. Am I right? If so, what is the difference between using it [the GTF file] first and using it second after the genome?

                Comment

                Latest Articles

                Collapse

                • seqadmin
                  Exploring the Dynamics of the Tumor Microenvironment
                  by seqadmin




                  The complexity of cancer is clearly demonstrated in the diverse ecosystem of the tumor microenvironment (TME). The TME is made up of numerous cell types and its development begins with the changes that happen during oncogenesis. “Genomic mutations, copy number changes, epigenetic alterations, and alternative gene expression occur to varying degrees within the affected tumor cells,” explained Andrea O’Hara, Ph.D., Strategic Technical Specialist at Azenta. “As...
                  07-08-2024, 03:19 PM
                • seqadmin
                  Exploring Human Diversity Through Large-Scale Omics
                  by seqadmin


                  In 2003, researchers from the Human Genome Project (HGP) announced the most comprehensive genome to date1. Although the genome wasn’t fully completed until nearly 20 years later2, numerous large-scale projects, such as the International HapMap Project and 1000 Genomes Project, continued the HGP's work, capturing extensive variation and genomic diversity within humans. Recently, newer initiatives have significantly increased in scale and expanded beyond genomics, offering a more detailed...
                  06-25-2024, 06:43 AM

                ad_right_rmr

                Collapse

                News

                Collapse

                Topics Statistics Last Post
                Started by seqadmin, 07-19-2024, 07:20 AM
                0 responses
                39 views
                0 likes
                Last Post seqadmin  
                Started by seqadmin, 07-16-2024, 05:49 AM
                0 responses
                51 views
                0 likes
                Last Post seqadmin  
                Started by seqadmin, 07-15-2024, 06:53 AM
                0 responses
                62 views
                0 likes
                Last Post seqadmin  
                Started by seqadmin, 07-10-2024, 07:30 AM
                0 responses
                43 views
                0 likes
                Last Post seqadmin  
                Working...
                X