Seqanswers Leaderboard Ad

**GenoMax** · 04-16-2014, 07:09 AM

You can use the table browser from UCSC to get the data you need: http://genome.ucsc.edu/cgi-bin/hgTables?command=start

Use the right genome/build for the data. You can then use the "identifiers" button to paste in your list of id's.

**watermark** · 04-16-2014, 07:17 AM

Yes, I tried it, but I couldn't find the hits.
I use genome version hg19, human. the ID which I pase is human transcripts from TCGA.

**GenoMax** · 04-16-2014, 07:42 AM

I recollect that we had a previous discussion about this where I had posted a link to the file that had the mapping info: https://webshare.bioinf.unc.edu/publ...ownToLocus.txt

It appears that you are now looking at some isoforms that did not have corresponding name in the file. Is that correct?

**watermark** · 04-16-2014, 08:09 AM

Basically what I want to do is to convert this isoform IDs(transcript ID) to the transcript ID in the TargetScan.

More percisly I want to map the isforms IDs from TCGA to IDs like :

Transcript ID in TargetScan
NM_130786
NM_130786
NM_130786
NM_130786
NM_130786

TCGA Transcript ID:
uc001udr.2

**GenoMax** · 04-16-2014, 08:27 AM

Download the file here: http://hgdownload.soe.ucsc.edu/golde...oRefSeq.txt.gz

You should be able to grep out the id's from the file (the number at the end may be different than the one you have, I think that is the version number).

Code:

$ grep "uc001udr." knownToRefSeq.txt
uc001udr.3      NM_001243014

**watermark** · 04-16-2014, 10:01 AM

Thanks Genomax, it was helpful.

**watermark** · 04-16-2014, 10:12 AM

what is the knownToRefSeq ? it's basically are all transcript(isoforms) which are known ? so I will miss lots of the isforms when I'm converitng ?

Originally posted by GenoMax View Post

Download the file here: http://hgdownload.soe.ucsc.edu/golde...oRefSeq.txt.gz

You should be able to grep out the id's from the file (the number at the end may be different than the one you have, I think that is the version number).

Code:

$ grep "uc001udr." knownToRefSeq.txt
uc001udr.3      NM_001243014

**GenoMax** · 04-16-2014, 11:16 AM

Originally posted by watermark View Post

what is the knownToRefSeq ? it's basically are all transcript(isoforms) which are known ? so I will miss lots of the isforms when I'm converitng ?

knownToRefSeq will relate the NM_* and NR_* names to UCSC gene names uc0*

See Hiram Clawson's (UCSC support) answer in this thread: http://redmine.soe.ucsc.edu/forum/in...01fe5506a9f16c

Topics	Statistics	Last Post
Genetic Variants and Diabetes Risk in Childhood Cancer Survivors by seqadmin Started by seqadmin, Yesterday, 08:47 AM	0 responses 16 views 0 likes	Last Post by seqadmin Yesterday, 08:47 AM
Cancer Metastasis: A Deep Dive into Cellular Plasticity by seqadmin Started by seqadmin, 04-11-2024, 12:08 PM	0 responses 60 views 0 likes	Last Post by seqadmin 04-11-2024, 12:08 PM
Proteogenomic Profiles Offer New Clues in Prostate Cancer by seqadmin Started by seqadmin, 04-10-2024, 10:19 PM	0 responses 60 views 0 likes	Last Post by seqadmin 04-10-2024, 10:19 PM
Novel Diagnostic Assay Enhances Ovarian Cancer Detection by seqadmin Started by seqadmin, 04-10-2024, 09:21 AM	0 responses 54 views 0 likes	Last Post by seqadmin 04-10-2024, 09:21 AM

Seqanswers Leaderboard Ad

Announcement

Trouble in converting UCSC transcripts IDs

Comment

Comment

Comment

Comment

Comment

Comment

Comment

Comment

Latest Articles

ad_right_rmr

News