Seqanswers Leaderboard Ad

**dpryan** · 06-15-2015, 02:14 AM

Perhaps an example is the simplest approach:

Code:

design = ~age+gender

"age" and "gender" would then be different "r"s. In the case of "age", there may in fact be multiple "r"s, if for example age is a factor with more than two levels (e.g., "young", "middleAged" and "old"). So yes, it's saying that the coefficients are distributed about 0.

**tirohia** · 06-15-2015, 04:36 PM

Okay. That makes sense. Thanks. A further question if I may though.

The definition of βir provided is the LFC for gene i, covariate r. Does this mean that βir is the LFC between the replicates within a given r or is it between two r's?

If it's the former, isn't that was what the NB distribution was being fitted on in eqn(1)/(2)? In which case, if I have 3 replicates variance between those three replicates fits a NB but the LFC's between the 3 are normally distributed?

If it's the later, which two (I have multiple) r's? In which case, is the assumption in eqn(10) that the LFC of all possible comparisons of r's are normally distributed?

There feels like there's a conceptual thing here that I'm not getting.

Cheers
Ben.

**dpryan** · 06-15-2015, 11:26 PM

Well, it's the LFC due to that particular coefficient. Whether it's versus the mean across samples or a traditional intercept (i.e., one of the samples) will depend on whether the expanded model matrix (something particular to DESeq2) is being used.

In the traditional method (i.e., with no extended model matrix), R will select the alphabetically first factor in a coefficient as the base level for further comparisons, so chose that wisely. This is actually one of the clever things about DESeq2, since the expanded model matrix allows shrinkage with a prior while maintaining constant log2FC due to a contrast regardless of the base level of a factor you chose. You might read "help(nbinomWaldTest)" for some more information.

**tirohia** · 06-16-2015, 06:04 PM

help(nbinomWaldTest) appears to be a much more succinct summary of the paper. It helps, thanks.

Now that you mention it, I do recall the DESeq2 manual somewhere saying that uses the first factor as the base level somewhere. I appear to have completely failed to connect the dots betwixt the practical instructions in the manual and the theory in the paper.

Many thanks.
Ben.

Topics	Statistics	Last Post
Cancer Metastasis: A Deep Dive into Cellular Plasticity by seqadmin Started by seqadmin, 04-11-2024, 12:08 PM	0 responses 30 views 0 likes	Last Post by seqadmin 04-11-2024, 12:08 PM
Proteogenomic Profiles Offer New Clues in Prostate Cancer by seqadmin Started by seqadmin, 04-10-2024, 10:19 PM	0 responses 32 views 0 likes	Last Post by seqadmin 04-10-2024, 10:19 PM
Novel Diagnostic Assay Enhances Ovarian Cancer Detection by seqadmin Started by seqadmin, 04-10-2024, 09:21 AM	0 responses 28 views 0 likes	Last Post by seqadmin 04-10-2024, 09:21 AM
Evolutionary Dynamics of Centromeres: A Comparative Genomic Analysis by seqadmin Started by seqadmin, 04-04-2024, 09:00 AM	0 responses 52 views 0 likes	Last Post by seqadmin 04-04-2024, 09:00 AM

Seqanswers Leaderboard Ad

Announcement

Question about DESeq2 LFC shrinkage estimation.

Comment

Comment

Comment

Comment

Latest Articles

ad_right_rmr

News