Seqanswers Leaderboard Ad

Collapse

Announcement

Collapse
No announcement yet.
X
 
  • Filter
  • Time
  • Show
Clear All
new posts

  • Human Ensembl GFF file, identical start and stop in CDS

    Hi all,

    I am discovering the pleasure to work with GFF files and I have a question related to the human GFF file present in Ensembl.
    More particulary if I look at this transcript:



    FOPNL-007

    Region: chromosome:GRCh37:16:15961195:15982482:1 Transcript: ENST00000575073 (FOPNL-007)
    16 Ensembl_havana Exon 15961195 15961373 . - 2 gene_id=ENSG00000133393; gene_name=FOPNL; transcript_id=ENST00000575073; transcript_name=FOPNL-007; exon_id=ENSE00002640477; gene_type=KNOWN_protein_coding
    16 Ensembl_havana Exon 15973661 15973745 . - 1 gene_id=ENSG00000133393; gene_name=FOPNL; transcript_id=ENST00000575073; transcript_name=FOPNL-007; exon_id=ENSE00003662092; gene_type=KNOWN_protein_coding
    16 Ensembl_havana Exon 15977865 15978062 . - 2 gene_id=ENSG00000133393; gene_name=FOPNL; transcript_id=ENST00000575073; transcript_name=FOPNL-007; exon_id=ENSE00000909153; gene_type=KNOWN_protein_coding
    16 Ensembl_havana Exon 15982415 15982482 . - . gene_id=ENSG00000133393; gene_name=FOPNL; transcript_id=ENST00000575073; transcript_name=FOPNL-007; exon_id=ENSE00002635299; gene_type=KNOWN_protein_coding

    and the same transcript in the Ensembl GFF file:

    ftp://ftp.ensembl.org/pub/release-75...Ch37.75.gtf.gz

    16 protein_coding transcript 15961195 15982482 . - . gene_id "ENSG00000133393"; transcript_id "ENST00000575073"; gene_name "FOPNL"; gene_source "ensembl_havana"; gene_biotype "protein_coding"; transcript_name "FOPNL-007"; transcript_source "havana";
    16 protein_coding exon 15982415 15982482 . - . gene_id "ENSG00000133393"; transcript_id "ENST00000575073"; exon_number "1"; gene_name "FOPNL"; gene_source "ensembl_havana"; gene_biotype "protein_coding"; transcript_name "FOPNL-007"; transcript_source "havana"; exon_id "ENSE00002635299";
    16 protein_coding CDS 15982415 15982442 . - 0 gene_id "ENSG00000133393"; transcript_id "ENST00000575073"; exon_number "1"; gene_name "FOPNL"; gene_source "ensembl_havana"; gene_biotype "protein_coding"; transcript_name "FOPNL-007"; transcript_source "havana"; protein_id "ENSP00000459804";
    16 protein_coding start_codon 15982440 15982442 . - 0 gene_id "ENSG00000133393"; transcript_id "ENST00000575073"; exon_number "1"; gene_name "FOPNL"; gene_source "ensembl_havana"; gene_biotype "protein_coding"; transcript_name "FOPNL-007"; transcript_source "havana";
    16 protein_coding exon 15977865 15978062 . - . gene_id "ENSG00000133393"; transcript_id "ENST00000575073"; exon_number "2"; gene_name "FOPNL"; gene_source "ensembl_havana"; gene_biotype "protein_coding"; transcript_name "FOPNL-007"; transcript_source "havana"; exon_id "ENSE00000909153";
    16 protein_coding CDS 15977865 15978062 . - 2 gene_id "ENSG00000133393"; transcript_id "ENST00000575073"; exon_number "2"; gene_name "FOPNL"; gene_source "ensembl_havana"; gene_biotype "protein_coding"; transcript_name "FOPNL-007"; transcript_source "havana"; protein_id "ENSP00000459804";
    16 protein_coding exon 15973661 15973745 . - . gene_id "ENSG00000133393"; transcript_id "ENST00000575073"; exon_number "3"; gene_name "FOPNL"; gene_source "ensembl_havana"; gene_biotype "protein_coding"; transcript_name "FOPNL-007"; transcript_source "havana"; exon_id "ENSE00003662092";
    16 protein_coding CDS 15973661 15973745 . - 2 gene_id "ENSG00000133393"; transcript_id "ENST00000575073"; exon_number "3"; gene_name "FOPNL"; gene_source "ensembl_havana"; gene_biotype "protein_coding"; transcript_name "FOPNL-007"; transcript_source "havana"; protein_id "ENSP00000459804";
    16 protein_coding exon 15961195 15961373 . - . gene_id "ENSG00000133393"; transcript_id "ENST00000575073"; exon_number "4"; gene_name "FOPNL"; gene_source "ensembl_havana"; gene_biotype "protein_coding"; transcript_name "FOPNL-007"; transcript_source "havana"; exon_id "ENSE00002640477";
    16 protein_coding CDS 15961373 15961373 . - 1 gene_id "ENSG00000133393"; transcript_id "ENST00000575073"; exon_number "4"; gene_name "FOPNL"; gene_source "ensembl_havana"; gene_biotype "protein_coding"; transcript_name "FOPNL-007"; transcript_source "havana"; protein_id "ENSP00000459804";
    16 protein_coding stop_codon 15961370 15961372 . - 0 gene_id "ENSG00000133393"; transcript_id "ENST00000575073"; exon_number "4"; gene_name "FOPNL"; gene_source "ensembl_havana"; gene_biotype "protein_coding"; transcript_name "FOPNL-007"; transcript_source "havana";
    16 protein_coding UTR 15982443 15982482 . - . gene_id "ENSG00000133393"; transcript_id "ENST00000575073"; gene_name "FOPNL"; gene_source "ensembl_havana"; gene_biotype "protein_coding"; transcript_name "FOPNL-007"; transcript_source "havana";
    16 protein_coding UTR 15961195 15961369 . - . gene_id "ENSG00000133393"; transcript_id "ENST00000575073"; gene_name "FOPNL"; gene_source "ensembl_havana"; gene_biotype "protein_coding"; transcript_name "FOPNL-007"; transcript_source "havana";

    The exons are fine but is it normal that the last CDS have a length of 0?
    Thanks!

Latest Articles

Collapse

  • seqadmin
    Recent Advances in Sequencing Analysis Tools
    by seqadmin


    The sequencing world is rapidly changing due to declining costs, enhanced accuracies, and the advent of newer, cutting-edge instruments. Equally important to these developments are improvements in sequencing analysis, a process that converts vast amounts of raw data into a comprehensible and meaningful form. This complex task requires expertise and the right analysis tools. In this article, we highlight the progress and innovation in sequencing analysis by reviewing several of the...
    05-06-2024, 07:48 AM
  • seqadmin
    Essential Discoveries and Tools in Epitranscriptomics
    by seqadmin




    The field of epigenetics has traditionally concentrated more on DNA and how changes like methylation and phosphorylation of histones impact gene expression and regulation. However, our increased understanding of RNA modifications and their importance in cellular processes has led to a rise in epitranscriptomics research. “Epitranscriptomics brings together the concepts of epigenetics and gene expression,” explained Adrien Leger, PhD, Principal Research Scientist...
    04-22-2024, 07:01 AM

ad_right_rmr

Collapse

News

Collapse

Topics Statistics Last Post
Started by seqadmin, Yesterday, 06:57 AM
0 responses
11 views
0 likes
Last Post seqadmin  
Started by seqadmin, 05-06-2024, 07:17 AM
0 responses
16 views
0 likes
Last Post seqadmin  
Started by seqadmin, 05-02-2024, 08:06 AM
0 responses
19 views
0 likes
Last Post seqadmin  
Started by seqadmin, 04-30-2024, 12:17 PM
0 responses
24 views
0 likes
Last Post seqadmin  
Working...
X