Unconfigured Ad

**shurjo** · 02-08-2010, 01:55 PM

This depends on how you want to treat your data. Giving TopHat the annotation file will force it look for the junctions contained therein even if it would not have considered them otherwise. There is a gtf2gff3 script available online (google the term) that you can use to make a GFF3 file for hg18 from the hg18 knownGenes table (which is downloadable in GTF format).

HTH,

Shurjo

**Wei-HD** · 02-08-2010, 02:05 PM

Hi shurjo,

Thanks your reply. I already have the GFF3 file of mouse Mus_musculus.NCBIM37.56.gff3. But still have no clue when I should use this GFF file, before or after tophat running? sorry I am a bit confused.

Many thanks!

**svl** · 02-08-2010, 02:13 PM

I am not sure what exactly you want, but if you:

1) want to use a GFF file to find out about gene-expression, then tophat since version 1.0.12 says: "TopHat no longer calculates gene expression. Users interested in expression calculations should consider using Cufflinks for gene- and isoform-level expression calculations."

or

2) want to provide your own junctions, then search the manual for "Supplying your own junctions" and you'll see the "-G/--GFF <GFF3 file>" flag explained

svl

**shurjo** · 02-08-2010, 02:15 PM

Neither before nor after but during the TopHat run :-). Use it with the -G option to Tophat

Like so:

tophat --mate-inner-dist 240 --mate-std-dev 25 ~/bin/bowtie/bowtie-0.12.1/indexes/hg18_inclusive 108971.read1.fa 108971.read2.fa -m 2 -p 4 -G /home/sensh/pipeline_test/GFF3/UCSC_knowngenes_hg18_tweaked.gff3

**Wei-HD** · 02-08-2010, 02:26 PM

Thanks Shurjo and svl!

I just want to provide my own junctions. Therefore I should write (I put data file: bic.txt, and index file as well as GFF3 file in the same folder):

tophat --solexa1.3-quals Mus_musculus.NCBIM37.56 bic.txt -G mus_musculus.NCBIM37.56.gff3

But I got en error: Error: you must set the mean inner distance between mates with -r
And my data is not pair-end data.

Thanks in advance!

**svl** · 02-08-2010, 02:38 PM

Originally posted by Wei-HD View Post

tophat --solexa1.3-quals Mus_musculus.NCBIM37.56 bic.txt -G mus_musculus.NCBIM37.56.gff3

Maybe you have to put all options before the index-base and reads. The manual says:

Usage: tophat [options]* <index_base> <reads1_1[,...,readsN_1]> [reads1_2,...readsN_2]

Topics	Statistics	Last Post
Whole-Genome Sequencing Traces Faroe Islands Ancestry to a North Atlantic Founder Population by SEQadmin2 Started by SEQadmin2, 06-17-2026, 06:09 AM	0 responses 31 views 0 reactions	Last Post by SEQadmin2 06-17-2026, 06:09 AM
Sequencing the Two-Toed Sloth Genome Reveals Jumping Genes Tied to Its Extreme Metabolism by SEQadmin2 Started by SEQadmin2, 06-09-2026, 11:58 AM	0 responses 96 views 0 reactions	Last Post by SEQadmin2 06-09-2026, 11:58 AM
A New Method Makes Hantavirus Genome Analysis Faster and More Accessible by SEQadmin2 Started by SEQadmin2, 06-05-2026, 10:09 AM	0 responses 117 views 0 reactions	Last Post by SEQadmin2 06-05-2026, 10:09 AM
A New Single-Cell Method Maps DNA-Protein Interactions by SEQadmin2 Started by SEQadmin2, 06-04-2026, 08:59 AM	0 responses 109 views 0 reactions	Last Post by SEQadmin2 06-04-2026, 08:59 AM

Unconfigured Ad

GFF3 annotation file

Comment

Comment

Comment

Comment

Comment

Comment

Latest Articles

ad_right_rmr

News