Seqanswers Leaderboard Ad

Collapse

Announcement

Collapse
No announcement yet.
X
 
  • Filter
  • Time
  • Show
Clear All
new posts

  • Tophat v1.1 with GTF files

    I wanted to try tophat 1.1 with a UCSC supplied GTF file, but the binary (linux-x86_64) I downloaded keeps asking for a GFF file. Has anyone had success running this version with a GTF file?

  • #2
    I have had succes with the 1.1.0 binary. Are you sure you are running the updated TopHat and not an old one left on your system? You can check using the --version option when you run TopHat ($ tophat --version).

    Comment


    • #3
      It's working! I had some deprecated paths in my scripts which were causing the problem. Thanks for your help!

      Comment


      • #4
        where can I get reliable GTF annotation files?

        I'm trying to get the GTF files for hg18, preferably at isoform level, from UCSC browser portal, in order to run with Tophat and cufflinks, but apparently I can't find such files there.
        So far I've managed to download a table from here http://genome.ucsc.edu/cgi-bin/hgTables?command=start but I'm not quite sure if it's the right way to do it. How did you guys get the file?

        thanks

        Comment


        • #5
          You are correct in using the table browser. To download a GTF file of a track, you need to select GTF in the output format dropdown menu, type a name for the output file, for instance hg18.UCSCknowngene.isoforms.gtf, then click get output. That should get you a GTF file.

          Comment


          • #6
            yes that what I did, but the problem is that there are loads of options for group/track/table and I don't find any single combination to look significantly more appealing. I'm looking for a detailed annotation for hg18, currently I have taken, the GTF file with:

            Group: Genes and Gene prediction Tracks
            Track: RefSeq Genes
            Table: refGene

            but I dunno how sensible/standard choice it is! and I guess it does not contain isoform level annotation.

            Comment


            • #7
              RefSeq gene does include a lot of isoforms (any that have RefSeq mRNA entries), but there are certainly isoforms expected to be missing.

              Comment


              • #8
                So you can supply TopHat with a GTF file of annotated transcripts, which, using the --GTF option, will be the first place where reads are mapped, followed by the whole genome, with or without novel junction discovery in this second stage. As I understand it, this is after TopHat 1.4.
                I'm curious to know how t was before 1.4. I think you could already give TopHat a GTF file, but it used it second. Am I right? If so, what is the difference between using it [the GTF file] first and using it second after the genome?

                Comment

                Latest Articles

                Collapse

                • seqadmin
                  Best Practices for Single-Cell Sequencing Analysis
                  by seqadmin



                  While isolating and preparing single cells for sequencing was historically the bottleneck, recent technological advancements have shifted the challenge to data analysis. This highlights the rapidly evolving nature of single-cell sequencing. The inherent complexity of single-cell analysis has intensified with the surge in data volume and the incorporation of diverse and more complex datasets. This article explores the challenges in analysis, examines common pitfalls, offers...
                  06-06-2024, 07:15 AM
                • seqadmin
                  Latest Developments in Precision Medicine
                  by seqadmin



                  Technological advances have led to drastic improvements in the field of precision medicine, enabling more personalized approaches to treatment. This article explores four leading groups that are overcoming many of the challenges of genomic profiling and precision medicine through their innovative platforms and technologies.

                  Somatic Genomics
                  “We have such a tremendous amount of genetic diversity that exists within each of us, and not just between us as individuals,”...
                  05-24-2024, 01:16 PM

                ad_right_rmr

                Collapse

                News

                Collapse

                Topics Statistics Last Post
                Started by seqadmin, 06-07-2024, 06:58 AM
                0 responses
                179 views
                0 likes
                Last Post seqadmin  
                Started by seqadmin, 06-06-2024, 08:18 AM
                0 responses
                227 views
                0 likes
                Last Post seqadmin  
                Started by seqadmin, 06-06-2024, 08:04 AM
                0 responses
                184 views
                0 likes
                Last Post seqadmin  
                Started by seqadmin, 06-03-2024, 06:55 AM
                0 responses
                18 views
                0 likes
                Last Post seqadmin  
                Working...
                X