SEQanswers

Go Back   SEQanswers > Bioinformatics > Bioinformatics



Similar Threads
Thread Thread Starter Forum Replies Last Post
time course multi-factor design with DESeq2 Neytiri Bioinformatics 0 05-04-2017 06:45 AM
interactions with DESeq2 in a time-course analysis frymor Bioinformatics 4 11-12-2015 03:19 AM
DESeq2 Circadian Time Series without time 0 jaw_bio RNA Sequencing 2 09-18-2015 12:29 AM
DESeq2 question in dealing with time course data. liuse Bioinformatics 12 03-19-2015 03:10 AM
time course,multi groups with DESeq2 jutos Bioinformatics 7 01-20-2015 11:04 AM

Reply
 
Thread Tools
Old 08-06-2020, 06:10 AM   #1
gugpuccio
Junior Member
 
Location: Italy

Join Date: Aug 2020
Posts: 1
Default Deseq2 asymmetric time-course

Hi everyone, i have an asymmetrical time-course experiment. Timepoints are not equal for each "treatment" or "condition". In fact the T0 samples are common for all the conditions (AFT0R2, AFT0R3, AP_T0R1) and timepoints then varies for each condition. AP has two timepoints (T1 and T2), AF has three timepoints (T1, T2, T3) and UF has only one (T1). The coldata looks like this:

Code:
Sample Condition Time Replicate
AF_T0R2 CP 0 2
AF_T0R3 CP 0 3
AP_T0R1 CP 0 1
AF_T1R1 AF 1 1
AF_T1R2 AF 1 2
AF_T1R3 AF 1 3
AF_T2R1 AF 2 1
AF_T2R2 AF 2 2
AF_T2R3 AF 2 3
AF_T3R1 AF 3 1
AF_T3R2 AF 3 2
AF_T3R3 AF 3 3
AP_T1R1 AP 1 1
AP_T1R2 AP 1 2
AP_T1R3 AP 1 3
AP_T2R1 AP 2 1
AP_T2R2 AP 2 2
AP_T2R3 AP 2 3
UF_T1R1 UF 1 1
UF_T1R2 UF 1 2
UF_T1R3 UF 1 3
What i've done until now is subsetting the dataset to obtain symmetrical timepoints to perform a time-course analysis using this coldata:

Code:
Sample Condition Time Replicate
AF_T0R2_2 AF 0 2
AF_T0R3_2 AF 0 3
AP_T0R1_2 AF 0 1
AF_T1R1 AF 1 1
AF_T1R2 AF 1 2
AF_T1R3 AF 1 3
AF_T3R1 AF 2 1
AF_T3R2 AF 2 2
AF_T3R3 AF 2 3
AF_T0R2 AP 0 2
AF_T0R3 AP 0 3
AP_T0R1 AP 0 1
AP_T1R1 AP 1 1
AP_T1R2 AP 1 2
AP_T1R3 AP 1 3
AP_T2R1 AP 2 1
AP_T2R2 AP 2 2
AP_T2R3 AP 2 3
The main problem with this approach is that some samples and conditions are left out of the analysis and, more importantly, i had to duplicate the T0 samples that are common to both conditions using the 2 (AFT0R22, AFT0R2 etc). Using this approach i obtained fairly good results using:

Code:
ddsTC <- DESeqDataSetFromMatrix(countData = countdata, colData =    coldata, 
                                   design =  ~ Condition + Time + Condition:Time)

ddsTC <- DESeq(ddsTC, test="LRT", reduced = ~ Condition + Time)
Another approach would be to make single LRT tests for each condition using ~ Time and a reduced formula of ~ 1. This way i would obtain all DEGs that changes significantly during time (even with only 2 timepoints) for each condition. Is this correct? Then i should be able to do some clustering and find intersections etc. If so can you help me with this kind of analysis? How can i exclude samples from the coldata to perform the LRT test only on, for example, the AF condition? I would like to perform an LRT for each condition using Time as factor. Something like:

Code:
Sample Condition Time Replicate UF AF AP
AF_T0R2 CP 0 2 T0 T0 T0
AF_T0R3 CP 0 3 T0 T0 T0
AF_T1R1 AF 1 1  T1 
AF_T1R2 AF 1 2  T1 
AF_T1R3 AF 1 3  T1 
AF_T2R1 AF 2 1  T2 
AF_T2R2 AF 2 2  T2 
AF_T2R3 AF 2 3  T2 
AF_T3R1 AF 3 1  T3 
AF_T3R2 AF 3 2  T3 
AF_T3R3 AF 3 3  T3 
AP_T0R1 CP 0 1 T0 T0 T0
AP_T1R1 AP 1 1   T1
AP_T1R2 AP 1 2   T1
AP_T1R3 AP 1 3   T1
AP_T2R1 AP 2 1   T2
AP_T2R2 AP 2 2   T2
AP_T2R3 AP 2 3   T2
UF_T1R1 UF 1 1 T1  
UF_T1R2 UF 1 2 T1  
UF_T1R3 UF 1 3 T1
and using:

Code:
dds1 <- DESeqDataSetFromMatrix(countData = countdata, colData = coldata, 
                                design =  ~ AP)
dds1 <- DESeq(dds1 test = "LRT", reduced = ~ 1)

dds2 <- DESeqDataSetFromMatrix(countData = countdata, colData = coldata, 
                                design =  ~ AF)
dds2 <- DESeq(dds2, test = "LRT", reduced = ~ 1)

dds3 <- DESeqDataSetFromMatrix(countData = countdata, colData = coldata, 
                                design =  ~ UF)  
dds3 <- DESeq(dds3, test = "LRT", reduced = ~ 1)
leaving some of the rows empty is not working, and it is using an empty factor. I was able to obtain DEGs for each comparison using subset of the dataset, but this way the sizefactors are different for each dataset. Probably i could use estimatesizefactors for the complete dataset and then use it for the subsets. Do you think this could work? Could you provide an example of code for doing it?

I hope i was clear enough. Thank you for the help!

Guglielmo
gugpuccio is offline   Reply With Quote
Reply

Thread Tools

Posting Rules
You may not post new threads
You may not post replies
You may not post attachments
You may not edit your posts

BB code is On
Smilies are On
[IMG] code is On
HTML code is Off




All times are GMT -8. The time now is 01:21 AM.


Powered by vBulletin® Version 3.8.9
Copyright ©2000 - 2020, vBulletin Solutions, Inc.
Single Sign On provided by vBSSO