Hey,
Hopefully this question is appropriate for this thread (Edited)
1) I need to compute amino acid substitution mutation frequency in a gene and I want to normalize these frequencies across all genes by dividing these frequencies with the respective lengths of the genes.
E.g. (A1T frequency in TP53 frequency) = (# of samples)/1254
My question is where do I find the size of all genes?
2) If I need to compute frequencies of gene mutations in general (containing not just non-synonymous but also insertions and deletion). How would I then normalize the mutation frequency for each gene based on the size? What size would I consider (gene chr position end - gene chr position start?)
Thanks.
Hopefully this question is appropriate for this thread (Edited)
1) I need to compute amino acid substitution mutation frequency in a gene and I want to normalize these frequencies across all genes by dividing these frequencies with the respective lengths of the genes.
E.g. (A1T frequency in TP53 frequency) = (# of samples)/1254
My question is where do I find the size of all genes?
2) If I need to compute frequencies of gene mutations in general (containing not just non-synonymous but also insertions and deletion). How would I then normalize the mutation frequency for each gene based on the size? What size would I consider (gene chr position end - gene chr position start?)
Thanks.
Comment