![]() |
|
![]() |
||||
Thread | Thread Starter | Forum | Replies | Last Post |
DESEQ2 Desing formula time series experiments with sham controls. | adroval | RNA Sequencing | 0 | 01-26-2018 12:39 PM |
Help with complicated DESeq2 design formula | aroomacanvas | Bioinformatics | 0 | 12-01-2015 10:35 AM |
DESeq2 - paired formula design with replicates | bryand | RNA Sequencing | 3 | 07-24-2015 03:22 AM |
DESeq2 model matrix formula | dmosk | Bioinformatics | 35 | 01-05-2015 03:42 PM |
Paired design versus unpaired design in DESeq2 | KristenC | RNA Sequencing | 1 | 05-29-2014 12:05 PM |
![]() |
|
Thread Tools |
![]() |
#1 | |
Junior Member
Location: Italy Join Date: May 2019
Posts: 1
|
![]()
Hi,
I am trying to understand the role of using an interaction term in the design formula of DESeq2. I have read this explanation: http://bioconductor.org/packages/dev...l#interactions This contains the following paragraph: Quote:
I would be happy if someone can confirm these affirmations to know if I understand this correctly: 1) = ~condition + genotype + condition:genotype This is not looking at differential expression between conditions, typically a WT vs KO. This is in fact detecting the genes that are differentially expressed between conditions AND differently between genotypes. 2) = ~ condition + genotype This is detecting differentially expressed genes correcting for the genotype effect. In other words, this is looking at differentially expressed genes between all the samples of condition A and all the samples of condition B, but correcting for the effect of the genotype (like we can correct for the batch effect). 3) =~condition Same as above but not correcting for the genotype effect. I would like also to know if the following statement is correct: If now considering batches instead of genotypes, if one uses a package for batch effect correction such as sva, we can say that: 1) (~condition + USAGE OF SVA) is equivalent, in the principle, to (~condition + batch). The difference is that a particular package will use a different method. Question: If the above statements are true, is it correct to say that the following code is equivalent to a 2 by 2 comparision in each genotype using only ~condition: `results(dds, contrast=c("group", "IB", "IA")) results(dds, contrast=c("group", "IIB", "IIA")) results(dds, contrast=c("group", "IIIB", "IIIA"))` or is it only subselecting genes that are different between all genotypes AND different between conditions for genotype X (X=c("I", "II", "III"))? Thanks a lot in advance. |
|
![]() |
![]() |
![]() |
#2 |
Senior Member
Location: Heidelberg, Germany Join Date: Aug 2009
Posts: 109
|
![]()
Hi Nicolas
Assertions 2-4 seem OK, but 1 is not correct. The best I could come up with to explain this is in the recent book: https://www.huber.embl.de/msmb/Chap-...ec:multifactor In particular, note that model formulae are not detecting any genes. They are a concise way of specifying a model with multiple parameters ("betas"), and the next step is saying which particular one of these parameters, or linear combination of them ("contrasts") you care about, and *then* you look for genes with a large value of this (univariate) parameter. Sorry, I didn't understand the "Question". Hope this helps (a little) - Wolfgang
__________________
Wolfgang Huber EMBL Last edited by Wolfgang Huber; 05-19-2019 at 01:12 PM. |
![]() |
![]() |
![]() |
Tags |
deseq2 |
Thread Tools | |
|
|