Unconfigured Ad

**apadr007** · 12-13-2011, 08:12 AM

I have the same question. Why is cufflinks repeating genes?

**kenphi** · 02-24-2012, 02:48 AM

Dear silin

I think this is because in your reference annotation there are "unrelated" transcripts annotated to the same gene. I noticed that this happens, when there are independent transcript groups, i.e. groups of transcripts that do not overlap in exon coordinates. The can be side-by-side or one in the intron of the other. Some examples are in Ensembl 64

ENSMUSG00000086255
ENSMUSG00000062352
ENSMUSG00000021879
ENSMUSG00000033705
ENSMUSG00000087461
ENSMUSG00000022105
ENSMUSG00000073791
ENSMUSG00000052675
ENSMUSG00000055407
ENSMUSG00000056856
ENSMUSG00000027203

In some of these cases, I would say that Ensembl didn't follow its own guidelines, to assign the same gene identifier to transcripts with overlapping position, because there are clearly independent clusters.

I keep them and use the gene_id column of cufflinks to make tables unique.

Philip

**emanlee** · 05-17-2014, 11:19 PM

Another thread on this issue:

Just a moment...

http://seqanswers.com/forums/showthread.php?t=5224

A solution based on mgogol's code:

CollapseFPKM - Browse Files at SourceForge.net

https://sourceforge.net/projects/collapsefpkm/files/?source=navbar

CollapseFPKM files. Full list of files for CollapseFPKM, This code is a solution to collapsing duplicate FPKMs for a gene

Topics	Statistics	Last Post
Whole-Genome Sequencing Traces Faroe Islands Ancestry to a North Atlantic Founder Population by SEQadmin2 Started by SEQadmin2, Yesterday, 06:09 AM	0 responses 16 views 0 reactions	Last Post by SEQadmin2 Yesterday, 06:09 AM
Sequencing the Two-Toed Sloth Genome Reveals Jumping Genes Tied to Its Extreme Metabolism by SEQadmin2 Started by SEQadmin2, 06-09-2026, 11:58 AM	0 responses 37 views 0 reactions	Last Post by SEQadmin2 06-09-2026, 11:58 AM
A New Method Makes Hantavirus Genome Analysis Faster and More Accessible by SEQadmin2 Started by SEQadmin2, 06-05-2026, 10:09 AM	0 responses 42 views 0 reactions	Last Post by SEQadmin2 06-05-2026, 10:09 AM
A New Single-Cell Method Maps DNA-Protein Interactions by SEQadmin2 Started by SEQadmin2, 06-04-2026, 08:59 AM	0 responses 49 views 0 reactions	Last Post by SEQadmin2 06-04-2026, 08:59 AM

Unconfigured Ad

Bug? duplicated genes in cufflinks output genes.expr

Comment

Comment

Comment

Latest Articles

ad_right_rmr

News