Hi
I have a file that has a one column like this:
ENSSSCG00000000005|ENSSSCT00000000006
ENSSSCG00000000005|ENSSSCT00000000006
ATXN10|ENSSSCT00000000009
ENSSSCG00000019685|ENSSSCT00000021280
LDOC1L|ENSSSCT00000000023
-
TSPO|ENSSSCT00000000035
ENSSSCG00000000032|ENSSSCT00000000034
ENSSSCG00000000032|ENSSSCT00000000034
TTLL1|ENSSSCT00000000037
TTLL1|ENSSSCT00000000037
TTLL1|ENSSSCT00000000037
TTLL1|ENSSSCT00000000037
How can I get rid of lines that start with ENS or those lines that after gene name has |EN.......? Actually I want to keep just gene names like in this example ATXN10,LDOC1L,
TSPO and TTLL1.
Anyone know how can I do that? Thanks for your help
I have a file that has a one column like this:
ENSSSCG00000000005|ENSSSCT00000000006
ENSSSCG00000000005|ENSSSCT00000000006
ATXN10|ENSSSCT00000000009
ENSSSCG00000019685|ENSSSCT00000021280
LDOC1L|ENSSSCT00000000023
-
TSPO|ENSSSCT00000000035
ENSSSCG00000000032|ENSSSCT00000000034
ENSSSCG00000000032|ENSSSCT00000000034
TTLL1|ENSSSCT00000000037
TTLL1|ENSSSCT00000000037
TTLL1|ENSSSCT00000000037
TTLL1|ENSSSCT00000000037
How can I get rid of lines that start with ENS or those lines that after gene name has |EN.......? Actually I want to keep just gene names like in this example ATXN10,LDOC1L,
TSPO and TTLL1.
Anyone know how can I do that? Thanks for your help
Comment