Seqanswers Leaderboard Ad

**dpryan** · 12-14-2015, 11:11 AM

You can simply exclude the patients for whom you only have a single sample. They'll get ignored in the analysis anyway. Anyway, yes your model is correct and you do indeed care most about the "visit:treatment" term.

**andrewelamb** · 12-15-2015, 11:57 AM

Thanks for the answer!

I did get this error however:

Error in checkFullRank(modelMatrix) :
the model matrix is not full rank, so the model cannot be fit as specified.
One or more variables or interaction terms in the design formula are linear
combinations of the others and must be removed.

my pheno file looks like:

sampleName visit condition patient
1 V2 control 1
2 V5 control 1
3 V2 treatment 2
4 V5 treatment 2
5 V5 treatment 3
6 V2 treatment 3
7 V5 treatment 4
8 V2 treatment 4
9 V2 control 5
10 V5 control 5

Removing patients from the experimental design worked. Is there any way, or value, to preserve the patient data?

**dpryan** · 12-16-2015, 12:42 AM

Indeed, I should have foreseen that :P

If you were to instead use "~patient+condition:visit+visit" and got rid of the "conditiontreatment:visitV2" column in the model matrix then the result would work. The original problem was that each condition is comprised of a set of patients, so you can't have patient coefficients and a "condition" coefficient (which is just the average of the patient coefficients!).

Sorry that that's so confusing.

**andrewelamb** · 12-16-2015, 06:30 AM

Thank you for the help!

I apologize, I'm not entirely clear on how to set up my model matrix based on your answer. It seems I would still need every column if I were to use "~patient+condition:visit+visit".

**dpryan** · 12-16-2015, 06:35 AM

I had a typo in my reply, I meant to remove the "conditiontreatment:visitV2" from the model matrix. That'll make it full rank,

**andrewelamb** · 12-16-2015, 07:31 AM

Ahh I see, I'm getting my sample table and the model matrix confused.

So is this the correct way to use my own model matrix?

design_string <- "~patient+condition:visit+visit"
sample_table <- read.table(input_file, row.names = NULL, header = T, sep = ",")
deseq_object <- DESeqDataSetFromHTSeqCount(sampleTable = sample_table,
design = ~condition, #have to have something here
directory = count_folder)
mm <- model.matrix(as.formula(design_string), sample_table)
mm <- mm[,-19] # gets rid of conditiontreatment:visitV2
deseq_object <- DESeq(deseq_object, full=mm, betaPrior=FALSE)

**dpryan** · 12-16-2015, 11:51 AM

Something along those lines at least.

Topics	Statistics	Last Post
Cancer Metastasis: A Deep Dive into Cellular Plasticity by seqadmin Started by seqadmin, 04-11-2024, 12:08 PM	0 responses 39 views 0 likes	Last Post by seqadmin 04-11-2024, 12:08 PM
Proteogenomic Profiles Offer New Clues in Prostate Cancer by seqadmin Started by seqadmin, 04-10-2024, 10:19 PM	0 responses 41 views 0 likes	Last Post by seqadmin 04-10-2024, 10:19 PM
Novel Diagnostic Assay Enhances Ovarian Cancer Detection by seqadmin Started by seqadmin, 04-10-2024, 09:21 AM	0 responses 35 views 0 likes	Last Post by seqadmin 04-10-2024, 09:21 AM
Evolutionary Dynamics of Centromeres: A Comparative Genomic Analysis by seqadmin Started by seqadmin, 04-04-2024, 09:00 AM	0 responses 55 views 0 likes	Last Post by seqadmin 04-04-2024, 09:00 AM

Seqanswers Leaderboard Ad

Announcement

eEtting up DESeq 2 analysis

Comment

Comment

Comment

Comment

Comment

Comment

Comment

Latest Articles

ad_right_rmr

News