Hi,
I am using the package EdgeR for calculating differential expression between genes. However I'm a bit of rookie in statistics and have a few questions regarding some concepts. I'd greatly appreciate your help in getting my head around these concepts. My questions are as follows:
1) What does Common Dispersion really mean?
Context: I have 2 replicates of my sample and a regular pairwise correlation suggests a coefficient of 0.98 between the replicates. However, when I run the estimateCommonDisp() function in EdgeR it gives me 0.035 as an estimate of common dispersion. The manual and some online posts seem to suggest that low estimate indicates high variability between replicates. Is that true or am I being mistaken?
2) Is it always better to use moderated tagwise dispersion (as it seems more customized to each tag than a general one-size-fits-all). When is it better to use tagwise over only common dispersion? Also, when you do smoothing, what do you really do?
3) How does the prior.n value affect the results?
Context: My prior.n value post smoothing is 0.0002284025. The manual asks prior.n=10 as default.It also says that lower the value, the more highly variable tags will be heavily penalized and there will be less "squeezing" of tagwise dispersions towards a common value. Is that good or bad? Should I use the value obtained with my dataset or some value between 10 and 40 as suggested?
Thanks in advance!
I am using the package EdgeR for calculating differential expression between genes. However I'm a bit of rookie in statistics and have a few questions regarding some concepts. I'd greatly appreciate your help in getting my head around these concepts. My questions are as follows:
1) What does Common Dispersion really mean?
Context: I have 2 replicates of my sample and a regular pairwise correlation suggests a coefficient of 0.98 between the replicates. However, when I run the estimateCommonDisp() function in EdgeR it gives me 0.035 as an estimate of common dispersion. The manual and some online posts seem to suggest that low estimate indicates high variability between replicates. Is that true or am I being mistaken?
2) Is it always better to use moderated tagwise dispersion (as it seems more customized to each tag than a general one-size-fits-all). When is it better to use tagwise over only common dispersion? Also, when you do smoothing, what do you really do?
3) How does the prior.n value affect the results?
Context: My prior.n value post smoothing is 0.0002284025. The manual asks prior.n=10 as default.It also says that lower the value, the more highly variable tags will be heavily penalized and there will be less "squeezing" of tagwise dispersions towards a common value. Is that good or bad? Should I use the value obtained with my dataset or some value between 10 and 40 as suggested?
Thanks in advance!
Comment