Hi All I am a rookie in RNA-seq.
I found a problem that gene ID output from Cuffdiff and egdeR/DESeq are different so that I cannot find the common DE genes.
My two pipelines:
Tophat-Cufflinks-Cuffmerge-Cuffdiff
Tophat-edgeR/DESeq
I use the data set with the Drosophila melanogaster genome.
The edgeR output gene ID (e.g. FBgn0000370,FBgn0000500,…)
But the Cuffdiff output gene ID is something like (XLOC_000028,XLOC_000038,…) and gene symbol (e.g.,KH1,RpLP1,…).
I found that the Cuffdiff 's gene ID is corresponding to merged.gtf (by Cuffmerge) but egdeR/DESeq's gene ID is corresponding to genes.gtf (by reference genome)
Could you please recommend any software or command to translate them automatically? Could you have command line for me to use to uniform or join them?
Thank you!
I found a problem that gene ID output from Cuffdiff and egdeR/DESeq are different so that I cannot find the common DE genes.
My two pipelines:
Tophat-Cufflinks-Cuffmerge-Cuffdiff
Tophat-edgeR/DESeq
I use the data set with the Drosophila melanogaster genome.
The edgeR output gene ID (e.g. FBgn0000370,FBgn0000500,…)
But the Cuffdiff output gene ID is something like (XLOC_000028,XLOC_000038,…) and gene symbol (e.g.,KH1,RpLP1,…).
I found that the Cuffdiff 's gene ID is corresponding to merged.gtf (by Cuffmerge) but egdeR/DESeq's gene ID is corresponding to genes.gtf (by reference genome)
Could you please recommend any software or command to translate them automatically? Could you have command line for me to use to uniform or join them?
Thank you!
Comment