Dear All,
I have a list of RefSeq IDs. Each ID consists of a pair of IDs (protein ID;mRNA ID) for example (NP_001135412.1; NM_001141940.1). I retrieved this list from Uniprot for different organisms (mainly mammalians).
My question is:
I want to get the promoter sequence of (+1000,-200 nt) starting at the gene start position (TSS) for each portein ID in my list. How can I do that?
Will I need to download all genomes of these organisms and get the TSS positions for all my RefSeq IDs then extract the promoters from the genomes? Any better solution?
Thanks in advance.
I have a list of RefSeq IDs. Each ID consists of a pair of IDs (protein ID;mRNA ID) for example (NP_001135412.1; NM_001141940.1). I retrieved this list from Uniprot for different organisms (mainly mammalians).
My question is:
I want to get the promoter sequence of (+1000,-200 nt) starting at the gene start position (TSS) for each portein ID in my list. How can I do that?
Will I need to download all genomes of these organisms and get the TSS positions for all my RefSeq IDs then extract the promoters from the genomes? Any better solution?
Thanks in advance.
Comment