Seqanswers Leaderboard Ad

**dpryan** · 02-10-2014, 08:47 AM

It's likely easier to just use cpm():

Code:

library(edgeR)
group <- c(rep("VA",3),rep("PA",3),rep("BP",3), rep("NP",3))
d <- DGEList(counts, group=group)
d <- calcNormFactors(d)
norm.counts <- cpm(d)*1e6

Or something close to that.

**antoza** · 02-11-2014, 09:22 AM

Thanks dpryan for your reply,

I have used your suggestion in running the code as below without face problems:

>library(edgeR)
>group <- c(rep("VA",3),rep("PA",3),rep("BP",3), rep("NP",3))
>d <- DGEList(counts=countsTable, group=group,lib.size=colSums(countsTable))
>f <- calcNormFactors(countsTable)
>f <- f/exp(mean(log(f)))
>d <- DGEList(counts=countsTable, group=group,lib.size=colSums(countsTable) * f)
>pair <- c("VA", "PA")
>d1 <- estimateCommonDisp(d)
>f
[1] 0.8485103 0.8982784 0.8927855 1.2081254 1.1679706 1.1371379 1.0063343 1.0768956 1.0409395 0.9355826 0.9358234 0.9272787 # I guess these are the normalization factors per each sample of my 12 (right?)
>norm.counts <- cpm(d)*1e6
#########################
However, after all I still have the following inquires:
1.When I remark the components of the object “f” among the others are mentioned the $pseudo.counts for all genes / per the 12 samples (what exactly are these numbers mean and which is their deference from normalized counts according to your suggestion?). There is also an indication of $pseudo.lib.size where I don’t quite understand ([1] 4867506). I checked also that the following code
cpm <- 1e06*t(t(d$counts) / (d$samples$lib.size*d$samples$norm.factors) )
might gives similar results as the syntax that you gave me. I am not quite sure whether the outputs of one or the other way would be the genes normalised counts or if the pseudo.counts which are mentioned above.

2. I have also find that you can calculate normalized counts running the following code which gave different results both from the pseudo.counts and norm.counts (cpm) results. I am quite confused about the difference among these counts matrixes per gene / per sample. Please could you please further clarify this towards this..
>scale <- d$samples$lib.size*d$samples$norm.factors
>normCounts <- round(t(t(countsTable)/scale)*mean(scale))

3. Finally, my common dispersion is 0.05073274 (is that mean that I have in general low coefficient of variation among my 3 replicates for all the 4 conditions (I don’t be fully sure about the biological value of common dispersion?)

Thank you so much……

Topics	Statistics	Last Post
Cancer Metastasis: A Deep Dive into Cellular Plasticity by seqadmin Started by seqadmin, 04-11-2024, 12:08 PM	0 responses 30 views 0 likes	Last Post by seqadmin 04-11-2024, 12:08 PM
Proteogenomic Profiles Offer New Clues in Prostate Cancer by seqadmin Started by seqadmin, 04-10-2024, 10:19 PM	0 responses 32 views 0 likes	Last Post by seqadmin 04-10-2024, 10:19 PM
Novel Diagnostic Assay Enhances Ovarian Cancer Detection by seqadmin Started by seqadmin, 04-10-2024, 09:21 AM	0 responses 28 views 0 likes	Last Post by seqadmin 04-10-2024, 09:21 AM
Evolutionary Dynamics of Centromeres: A Comparative Genomic Analysis by seqadmin Started by seqadmin, 04-04-2024, 09:00 AM	0 responses 53 views 0 likes	Last Post by seqadmin 04-04-2024, 09:00 AM

Seqanswers Leaderboard Ad

Announcement

edgeR mean normalized reads counts for each gene

Comment

Comment

Latest Articles

ad_right_rmr

News