SEQanswers

Go Back   SEQanswers > Bioinformatics > Bioinformatics



Similar Threads
Thread Thread Starter Forum Replies Last Post
gffread converts stop codons into CDS jg3197 Bioinformatics 0 06-05-2014 04:47 AM
start and stop position of mapping in Blast Navy1991 Bioinformatics 3 06-02-2014 05:25 AM
from CDS/genome sequences to gff file capricy Bioinformatics 6 09-20-2013 11:46 AM
how to fetch the sequences on the basis of start and stop codon M.Verma Bioinformatics 1 05-12-2013 10:44 PM
genename/start/stop position file bioinfo_ Bioinformatics 0 04-13-2012 12:57 PM

Reply
 
Thread Tools
Old 06-06-2014, 07:12 AM   #1
3sTan
Junior Member
 
Location: Germany

Join Date: May 2014
Posts: 2
Default Human Ensembl GFF file, identical start and stop in CDS

Hi all,

I am discovering the pleasure to work with GFF files and I have a question related to the human GFF file present in Ensembl.
More particulary if I look at this transcript:

http://www.ensembl.org/Homo_sapiens/...NST00000575073

FOPNL-007

Region: chromosome:GRCh37:16:15961195:15982482:1 Transcript: ENST00000575073 (FOPNL-007)
16 Ensembl_havana Exon 15961195 15961373 . - 2 gene_id=ENSG00000133393; gene_name=FOPNL; transcript_id=ENST00000575073; transcript_name=FOPNL-007; exon_id=ENSE00002640477; gene_type=KNOWN_protein_coding
16 Ensembl_havana Exon 15973661 15973745 . - 1 gene_id=ENSG00000133393; gene_name=FOPNL; transcript_id=ENST00000575073; transcript_name=FOPNL-007; exon_id=ENSE00003662092; gene_type=KNOWN_protein_coding
16 Ensembl_havana Exon 15977865 15978062 . - 2 gene_id=ENSG00000133393; gene_name=FOPNL; transcript_id=ENST00000575073; transcript_name=FOPNL-007; exon_id=ENSE00000909153; gene_type=KNOWN_protein_coding
16 Ensembl_havana Exon 15982415 15982482 . - . gene_id=ENSG00000133393; gene_name=FOPNL; transcript_id=ENST00000575073; transcript_name=FOPNL-007; exon_id=ENSE00002635299; gene_type=KNOWN_protein_coding

and the same transcript in the Ensembl GFF file:

ftp://ftp.ensembl.org/pub/release-75...Ch37.75.gtf.gz

16 protein_coding transcript 15961195 15982482 . - . gene_id "ENSG00000133393"; transcript_id "ENST00000575073"; gene_name "FOPNL"; gene_source "ensembl_havana"; gene_biotype "protein_coding"; transcript_name "FOPNL-007"; transcript_source "havana";
16 protein_coding exon 15982415 15982482 . - . gene_id "ENSG00000133393"; transcript_id "ENST00000575073"; exon_number "1"; gene_name "FOPNL"; gene_source "ensembl_havana"; gene_biotype "protein_coding"; transcript_name "FOPNL-007"; transcript_source "havana"; exon_id "ENSE00002635299";
16 protein_coding CDS 15982415 15982442 . - 0 gene_id "ENSG00000133393"; transcript_id "ENST00000575073"; exon_number "1"; gene_name "FOPNL"; gene_source "ensembl_havana"; gene_biotype "protein_coding"; transcript_name "FOPNL-007"; transcript_source "havana"; protein_id "ENSP00000459804";
16 protein_coding start_codon 15982440 15982442 . - 0 gene_id "ENSG00000133393"; transcript_id "ENST00000575073"; exon_number "1"; gene_name "FOPNL"; gene_source "ensembl_havana"; gene_biotype "protein_coding"; transcript_name "FOPNL-007"; transcript_source "havana";
16 protein_coding exon 15977865 15978062 . - . gene_id "ENSG00000133393"; transcript_id "ENST00000575073"; exon_number "2"; gene_name "FOPNL"; gene_source "ensembl_havana"; gene_biotype "protein_coding"; transcript_name "FOPNL-007"; transcript_source "havana"; exon_id "ENSE00000909153";
16 protein_coding CDS 15977865 15978062 . - 2 gene_id "ENSG00000133393"; transcript_id "ENST00000575073"; exon_number "2"; gene_name "FOPNL"; gene_source "ensembl_havana"; gene_biotype "protein_coding"; transcript_name "FOPNL-007"; transcript_source "havana"; protein_id "ENSP00000459804";
16 protein_coding exon 15973661 15973745 . - . gene_id "ENSG00000133393"; transcript_id "ENST00000575073"; exon_number "3"; gene_name "FOPNL"; gene_source "ensembl_havana"; gene_biotype "protein_coding"; transcript_name "FOPNL-007"; transcript_source "havana"; exon_id "ENSE00003662092";
16 protein_coding CDS 15973661 15973745 . - 2 gene_id "ENSG00000133393"; transcript_id "ENST00000575073"; exon_number "3"; gene_name "FOPNL"; gene_source "ensembl_havana"; gene_biotype "protein_coding"; transcript_name "FOPNL-007"; transcript_source "havana"; protein_id "ENSP00000459804";
16 protein_coding exon 15961195 15961373 . - . gene_id "ENSG00000133393"; transcript_id "ENST00000575073"; exon_number "4"; gene_name "FOPNL"; gene_source "ensembl_havana"; gene_biotype "protein_coding"; transcript_name "FOPNL-007"; transcript_source "havana"; exon_id "ENSE00002640477";
16 protein_coding CDS 15961373 15961373 . - 1 gene_id "ENSG00000133393"; transcript_id "ENST00000575073"; exon_number "4"; gene_name "FOPNL"; gene_source "ensembl_havana"; gene_biotype "protein_coding"; transcript_name "FOPNL-007"; transcript_source "havana"; protein_id "ENSP00000459804";
16 protein_coding stop_codon 15961370 15961372 . - 0 gene_id "ENSG00000133393"; transcript_id "ENST00000575073"; exon_number "4"; gene_name "FOPNL"; gene_source "ensembl_havana"; gene_biotype "protein_coding"; transcript_name "FOPNL-007"; transcript_source "havana";
16 protein_coding UTR 15982443 15982482 . - . gene_id "ENSG00000133393"; transcript_id "ENST00000575073"; gene_name "FOPNL"; gene_source "ensembl_havana"; gene_biotype "protein_coding"; transcript_name "FOPNL-007"; transcript_source "havana";
16 protein_coding UTR 15961195 15961369 . - . gene_id "ENSG00000133393"; transcript_id "ENST00000575073"; gene_name "FOPNL"; gene_source "ensembl_havana"; gene_biotype "protein_coding"; transcript_name "FOPNL-007"; transcript_source "havana";

The exons are fine but is it normal that the last CDS have a length of 0?
Thanks!
3sTan is offline   Reply With Quote
Reply

Thread Tools

Posting Rules
You may not post new threads
You may not post replies
You may not post attachments
You may not edit your posts

BB code is On
Smilies are On
[IMG] code is On
HTML code is Off




All times are GMT -8. The time now is 05:13 AM.


Powered by vBulletin® Version 3.8.9
Copyright ©2000 - 2018, vBulletin Solutions, Inc.
Single Sign On provided by vBSSO