Unconfigured Ad

**dpryan** · 12-14-2015, 11:11 AM

You can simply exclude the patients for whom you only have a single sample. They'll get ignored in the analysis anyway. Anyway, yes your model is correct and you do indeed care most about the "visit:treatment" term.

**andrewelamb** · 12-15-2015, 11:57 AM

Thanks for the answer!

I did get this error however:

Error in checkFullRank(modelMatrix) :
the model matrix is not full rank, so the model cannot be fit as specified.
One or more variables or interaction terms in the design formula are linear
combinations of the others and must be removed.

my pheno file looks like:

sampleName visit condition patient
1 V2 control 1
2 V5 control 1
3 V2 treatment 2
4 V5 treatment 2
5 V5 treatment 3
6 V2 treatment 3
7 V5 treatment 4
8 V2 treatment 4
9 V2 control 5
10 V5 control 5

Removing patients from the experimental design worked. Is there any way, or value, to preserve the patient data?

**dpryan** · 12-16-2015, 12:42 AM

Indeed, I should have foreseen that :P

If you were to instead use "~patient+condition:visit+visit" and got rid of the "conditiontreatment:visitV2" column in the model matrix then the result would work. The original problem was that each condition is comprised of a set of patients, so you can't have patient coefficients and a "condition" coefficient (which is just the average of the patient coefficients!).

Sorry that that's so confusing.

**andrewelamb** · 12-16-2015, 06:30 AM

Thank you for the help!

I apologize, I'm not entirely clear on how to set up my model matrix based on your answer. It seems I would still need every column if I were to use "~patient+condition:visit+visit".

**dpryan** · 12-16-2015, 06:35 AM

I had a typo in my reply, I meant to remove the "conditiontreatment:visitV2" from the model matrix. That'll make it full rank,

**andrewelamb** · 12-16-2015, 07:31 AM

Ahh I see, I'm getting my sample table and the model matrix confused.

So is this the correct way to use my own model matrix?

design_string <- "~patient+condition:visit+visit"
sample_table <- read.table(input_file, row.names = NULL, header = T, sep = ",")
deseq_object <- DESeqDataSetFromHTSeqCount(sampleTable = sample_table,
design = ~condition, #have to have something here
directory = count_folder)
mm <- model.matrix(as.formula(design_string), sample_table)
mm <- mm[,-19] # gets rid of conditiontreatment:visitV2
deseq_object <- DESeq(deseq_object, full=mm, betaPrior=FALSE)

**dpryan** · 12-16-2015, 11:51 AM

Something along those lines at least.

Topics	Statistics	Last Post
Long-Read RNA Sequencing Uncovers a Hidden Layer of Immune Cell Regulation by SEQadmin2 Started by SEQadmin2, 06-02-2026, 12:03 PM	0 responses 19 views 0 reactions	Last Post by SEQadmin2 06-02-2026, 12:03 PM
DNA Methylation Study Reveals How Epigenetic Changes Pass Between Generations by SEQadmin2 Started by SEQadmin2, 06-02-2026, 11:40 AM	0 responses 14 views 0 reactions	Last Post by SEQadmin2 06-02-2026, 11:40 AM
MetaBeeAI Helps Scientists Process Research Literature Faster by SEQadmin2 Started by SEQadmin2, 05-28-2026, 11:40 AM	0 responses 29 views 0 reactions	Last Post by SEQadmin2 05-28-2026, 11:40 AM
Scientists Solve a 25-Year Mystery in RNA Interference by SEQadmin2 Started by SEQadmin2, 05-26-2026, 10:12 AM	0 responses 31 views 0 reactions	Last Post by SEQadmin2 05-26-2026, 10:12 AM

Unconfigured Ad

eEtting up DESeq 2 analysis

Comment

Comment

Comment

Comment

Comment

Comment

Comment

Latest Articles

ad_right_rmr

News