Hi,
I was wondering if anyone can point me towards a way to combine lines of one *gtf file with another *.gtf file...? Specifically, I would like to extract certain genes from the latest Ensembl mouse *.gtf and combine them with the current UCSC RefGenes *gtf.
I thought I'd get the lines of interest by a simple 'grep "my_gene_id"' form the Ensembl file. I'm not quite sure though what the best way to concatenate / sort the merged file would be, and if there may be other problems with this (e.g. overlapping exons of Ensembl and UCSC transcripts).
Any lead on this would be great!
Thank you,
Jan
I was wondering if anyone can point me towards a way to combine lines of one *gtf file with another *.gtf file...? Specifically, I would like to extract certain genes from the latest Ensembl mouse *.gtf and combine them with the current UCSC RefGenes *gtf.
I thought I'd get the lines of interest by a simple 'grep "my_gene_id"' form the Ensembl file. I'm not quite sure though what the best way to concatenate / sort the merged file would be, and if there may be other problems with this (e.g. overlapping exons of Ensembl and UCSC transcripts).
Any lead on this would be great!
Thank you,
Jan
Comment