A denovo assembly of a new organism that lives in the sea (from the animal kingdom but we do not know its type specifically) have been prepared.
We want now to identify all possible transcription factors in this genome. So, I was thinking in the following steps:
1) Send the translated sequences (amino acid sequences of the genome) to Blast to identify gene/protein locations/sequences in the genome
2) Prepare list of TFs from multiple sources for different organisms (e.g. all TF Sequences of TFs in TRANSFAC, Human TFs...etc)
3) Use ORTHOMCL to identify which of TFs in step 2 is close (in the same cluster) to my proteins that were identified in step 1.
What do you think? is there any other suggestions?
We want now to identify all possible transcription factors in this genome. So, I was thinking in the following steps:
1) Send the translated sequences (amino acid sequences of the genome) to Blast to identify gene/protein locations/sequences in the genome
2) Prepare list of TFs from multiple sources for different organisms (e.g. all TF Sequences of TFs in TRANSFAC, Human TFs...etc)
3) Use ORTHOMCL to identify which of TFs in step 2 is close (in the same cluster) to my proteins that were identified in step 1.
What do you think? is there any other suggestions?
Comment