I want to do differential gene expression analysis on some Nile Tilapia RNA-Seq data using the Cufflinks-Cuffdiff method. I aligned the sequence reads to the UCSC Nile Tilapia genome with >80% mapping efficiency. I downloaded the annotation GTF file from Ensembl and converted it using the "Make ensembl GTP compatible with Cufflinks" work flow. This workflow adds "chr" in front of the chromosome number. Then I ran cufflinks on the paired-end mapped data with either the original and converted GTF file. However, the counts were 0 for all gene and transcript expression. I noticed the the chromosome names of the Tilapia gtf file are strange. They are "GL831133.1" instead of 1,2,3. I also got zero counts using HTseq-count with either the original and converted GTF file. Does anyone know a good reference annotation file that work for Tilapia? Thanks!
Seqanswers Leaderboard Ad
Collapse
Announcement
Collapse
No announcement yet.
X
-
Looks like Nile tilapia is available in UCSC Table Browser. You should be able to get a GTF file (which should match your sequence) from there.
Clade (Vertebrate) --> Genome (Nile Tilapia) --> group (Genes and Gene Predictions) --> Track (Choose one you want, RefSeq is available) --> Region (Genome) --> Output format (GTF) --> Provide a file name (e.g. tilapia_annot.gtf) --> Save/Use with HTseq-count of featureCounts.
Comment
Latest Articles
Collapse
-
by seqadmin
The field of epigenetics has traditionally concentrated more on DNA and how changes like methylation and phosphorylation of histones impact gene expression and regulation. However, our increased understanding of RNA modifications and their importance in cellular processes has led to a rise in epitranscriptomics research. “Epitranscriptomics brings together the concepts of epigenetics and gene expression,” explained Adrien Leger, PhD, Principal Research Scientist on Modified Bases...-
Channel: Articles
Yesterday, 07:01 AM -
-
by seqadmin
Proteins are often described as the workhorses of the cell, and identifying their sequences is key to understanding their role in biological processes and disease. Currently, the most common technique used to determine protein sequences is mass spectrometry. While still a valuable tool, mass spectrometry faces several limitations and requires a highly experienced scientist familiar with the equipment to operate it. Additionally, other proteomic methods, like affinity assays, are constrained...-
Channel: Articles
04-04-2024, 04:25 PM -
ad_right_rmr
Collapse
News
Collapse
Topics | Statistics | Last Post | ||
---|---|---|---|---|
Started by seqadmin, 04-11-2024, 12:08 PM
|
0 responses
39 views
0 likes
|
Last Post
by seqadmin
04-11-2024, 12:08 PM
|
||
Started by seqadmin, 04-10-2024, 10:19 PM
|
0 responses
41 views
0 likes
|
Last Post
by seqadmin
04-10-2024, 10:19 PM
|
||
Started by seqadmin, 04-10-2024, 09:21 AM
|
0 responses
35 views
0 likes
|
Last Post
by seqadmin
04-10-2024, 09:21 AM
|
||
Started by seqadmin, 04-04-2024, 09:00 AM
|
0 responses
55 views
0 likes
|
Last Post
by seqadmin
04-04-2024, 09:00 AM
|
Comment