Seqanswers Leaderboard Ad

**natstreet** · 11-04-2010, 11:40 PM

I found a great script for converting gff3 to gtf and also one for converting cufflinks gtf to gff3, both of which have saved me much hassle for using data from, and getting data into, GBrowse. By default the gff3togtf script creates gene_id entries in the attributes column but cufflinks will only work with gene_name. I've left the script in it's original form here but you should either change the script or post-process the gtf file produced using e.g. a sed command.

I've attached them both and as soon as our server with my notes stored on it is back up again I will edit this reply to link to the originals to make sure credit is given to the right people.

Attached Files

**Simon Anders** · 11-05-2010, 01:17 AM

Be sure to read the man page of htseq-count. There are options to tell how the gene ID attribute is called in your GFF file (Ensembl's standard is "gene_id", but as 'natstreet' just said, you also see 'gene_name', 'ID' or whatever).

**dingkai0564** · 11-05-2010, 09:56 AM

About the HTseq

Originally posted by Simon Anders View Post

Be sure to read the man page of htseq-count. There are options to tell how the gene ID attribute is called in your GFF file (Ensembl's standard is "gene_id", but as 'natstreet' just said, you also see 'gene_name', 'ID' or whatever).

Thanks for your advice. It seems that i can make the HTseq running,however,i only get the results of :

50972 GFF lines processed.
100000 reads processed.
200000 reads processed.
300000 reads processed.
400000 reads processed.
500000 reads processed.
600000 reads processed.
700000 reads processed.
727886 reads processed.
13101 229869
no_feature 498017
ambiguous 0
too low aQual 0
not aligned 4460065

but i can not get the results that counts for each feature. Could you tell me what i should do to get the number of each genes or each exon's short reads.

Thanks!

**dingkai0564** · 11-05-2010, 01:24 PM

Originally posted by dingkai0564 View Post

Thanks for your advice. It seems that i can make the HTseq running,however,i only get the results of :

50972 GFF lines processed.
100000 reads processed.
200000 reads processed.
300000 reads processed.
400000 reads processed.
500000 reads processed.
600000 reads processed.
700000 reads processed.
727886 reads processed.
13101 229869
no_feature 498017
ambiguous 0
too low aQual 0
not aligned 4460065

but i can not get the results that counts for each feature. Could you tell me what i should do to get the number of each genes or each exon's short reads.

Thanks!

Thank you all! i solve the problems.

**carmeyeii** · 12-14-2012, 12:49 PM

So you can supply TopHat with a GTF file of annotated transcripts, which, using the --GTF option, will be the first place where reads are mapped, followed by the whole genome, with or without novel junction discovery in this second stage. As I understand it, this is after TopHat 1.4.
I'm curious to know how t was before 1.4. I think you could already give TopHat a GTF file, but it used it second. Am I right? If so, what is the difference between using it [the GTF file] first and using it second after the genome?

Topics	Statistics	Last Post
Expanding the Horizons of Cellular Research with the Single Cell Atlas by seqadmin Started by seqadmin, Yesterday, 11:49 AM	0 responses 15 views 0 likes	Last Post by seqadmin Yesterday, 11:49 AM
Genetic Variants and Diabetes Risk in Childhood Cancer Survivors by seqadmin Started by seqadmin, 04-24-2024, 08:47 AM	0 responses 16 views 0 likes	Last Post by seqadmin 04-24-2024, 08:47 AM
Cancer Metastasis: A Deep Dive into Cellular Plasticity by seqadmin Started by seqadmin, 04-11-2024, 12:08 PM	0 responses 61 views 0 likes	Last Post by seqadmin 04-11-2024, 12:08 PM
Proteogenomic Profiles Offer New Clues in Prostate Cancer by seqadmin Started by seqadmin, 04-10-2024, 10:19 PM	0 responses 60 views 0 likes	Last Post by seqadmin 04-10-2024, 10:19 PM

Seqanswers Leaderboard Ad

Announcement

asking questions about gtf files

Comment

Comment

Comment

Comment

Comment

Latest Articles

ad_right_rmr

News