Go Back   SEQanswers > Applications Forums > RNA Sequencing

Similar Threads
Thread Thread Starter Forum Replies Last Post
Reference annotation file (gtf) for the NOD/ShiltJ mouse? michelletran RNA Sequencing 3 09-15-2015 06:16 AM
Annotation gtf file for GATK? michael.sharpnack RNA Sequencing 1 08-21-2014 02:36 PM
Creating new FA by extracting particular annotation (GTF) and sequence (reference FA) syintel87 Bioinformatics 0 02-06-2013 12:47 PM
entamoeba histolytica reference genome (gene annotation file) in GTF format paula123 Bioinformatics 1 06-04-2012 02:39 AM

Thread Tools
Old 05-19-2016, 12:32 PM   #1
Junior Member
Location: USA

Join Date: Sep 2012
Posts: 8
Default Reference annotation file (gtf) for Nile Tilapia (Oreochromis_niloticus)

I want to do differential gene expression analysis on some Nile Tilapia RNA-Seq data using the Cufflinks-Cuffdiff method. I aligned the sequence reads to the UCSC Nile Tilapia genome with >80% mapping efficiency. I downloaded the annotation GTF file from Ensembl and converted it using the "Make ensembl GTP compatible with Cufflinks" work flow. This workflow adds "chr" in front of the chromosome number. Then I ran cufflinks on the paired-end mapped data with either the original and converted GTF file. However, the counts were 0 for all gene and transcript expression. I noticed the the chromosome names of the Tilapia gtf file are strange. They are "GL831133.1" instead of 1,2,3. I also got zero counts using HTseq-count with either the original and converted GTF file. Does anyone know a good reference annotation file that work for Tilapia? Thanks!
Attached Images
File Type: jpg Nile Tilapia gtf2.jpg (96.5 KB, 4 views)
File Type: jpg Nile Tilapia cufflinks gene expression2.jpg (94.1 KB, 4 views) is offline   Reply With Quote
Old 05-19-2016, 02:05 PM   #2
Devon Ryan
Location: Freiburg, Germany

Join Date: Jul 2011
Posts: 3,480

Ensembl hasn't bothered to include oreNil2, which is what you got from UCSC. If UCSC doesn't provide an annotation then perhaps none exists.
dpryan is offline   Reply With Quote
Old 05-19-2016, 03:42 PM   #3
Senior Member
Location: East Coast USA

Join Date: Feb 2008
Posts: 7,076

Looks like Nile tilapia is available in UCSC Table Browser. You should be able to get a GTF file (which should match your sequence) from there.

Clade (Vertebrate) --> Genome (Nile Tilapia) --> group (Genes and Gene Predictions) --> Track (Choose one you want, RefSeq is available) --> Region (Genome) --> Output format (GTF) --> Provide a file name (e.g. tilapia_annot.gtf) --> Save/Use with HTseq-count of featureCounts.
GenoMax is offline   Reply With Quote
Old 05-24-2016, 01:25 PM   #4
Junior Member
Location: USA

Join Date: Sep 2012
Posts: 8

Yes, it turns out that the UCSC genome is OreNil2.0 and the ensembl annotation is OreNil1.0. I downloaded the ensembl tilapia genome (OreNil1.0) and did the tophat alignment again. This time, I could run cufflinks and HTseq with the ensembl annotation. is offline   Reply With Quote

Thread Tools

Posting Rules
You may not post new threads
You may not post replies
You may not post attachments
You may not edit your posts

BB code is On
Smilies are On
[IMG] code is On
HTML code is Off

All times are GMT -8. The time now is 11:57 PM.

Powered by vBulletin® Version 3.8.9
Copyright ©2000 - 2020, vBulletin Solutions, Inc.
Single Sign On provided by vBSSO