Seqanswers Leaderboard Ad

**rozitaa** · 03-30-2015, 06:51 AM

Originally posted by Michael Love View Post

hi rozitaa,

The issue here is that there are more animals in certain tissues. So when the R function model.matrix tries to form the model matrix internal to the DESeq() function, this results in a column of zeros for the nested animal 6, tissue A combination for example.

There is an easy solution though. In the next release of Bioconductor, which will be released 17 April 2015, and has DESeq2 v1.8, you can provide a custom model matrix to DESeq(). (You can test the development version of DESeq2 v1.7 already, but this requires downloading the development version of R, and this is not impossible but not always easy.)

So all you need to do is to remove these interaction columns which have no samples:

Code:

design <- ~ gut_microbiota + gut_microbiota:animal.nested + gut_microbiota:tissue
mm <- model.matix(design, data=colData(dds))
## remove columns of mm which have all zeros:
(all.zero <- apply(mm, 2, function(x) all(x == 0)))
mm <- mm[, !all.zero]
dds <- DESeq(dds, design=mm, betaPrior=FALSE)

We do have to turn off the log2 fold change shrinkage for this options though (betaPrior=FALSE). But the inference and results tables work the same.

Thanks a lot Micheal! I got the idea!

The only problem that I have now is I am using HTSeq-count data this my code:

Code:

design <- ~ gut_microbiota + gut_microbiota:animal.nested + gut_microbiota:tissue
mm <- model.matrix(design, data=sample_table)
(all.zero <- apply(mm, 2, function(x) all(x == 0)))
mm <- as.data.frame(mm[, !all.zero])
dds = DESeqDataSetFromHTSeqCount(sampleTable=sample_table, directory='~/htseq/', design=mm)

But I get an error:

Code:

Error in terms.default(object, data = data) : 
  no terms component nor attribute

Is it because of the way I am fitting the design matrix into the htseq-count data?

**Michael Love** · 03-30-2015, 06:54 AM

So, again, you'll only be able to do this custom model matrix in version 1.8.

packageVersion("DESeq2")

Additionally, you can only provide the matrix to the 'full' argument of DESeq(). In order to construct a DESeqDataSet object, you can use design=~1, because this will be ignored and the custom matrix will be used instead.

**rozitaa** · 03-30-2015, 07:04 AM

Originally posted by Michael Love View Post

So, again, you'll only be able to do this custom model matrix in version 1.8.

packageVersion("DESeq2")

Additionally, you can only provide the matrix to DESeq(). In order to construct a DESeqDataSet object, you can use design=~1, because this will be ignored and the custom matrix will be used instead.

I see!! I cannot wait till April 17th since I have deadline. And it is hard for me to install the developmental version of R! Is there any other solution for my problem?

**Michael Love** · 03-30-2015, 07:35 AM

"Now I would like to see DE genes resulted from comparing of every gut_microbiota status (B, C, D, E) to the baseline (A) on each tissue separately."

Sorry, reading over your column data and question more carefully, I realized that this advice about nesting doesn't apply to your experimental design. The nesting approach works for experimental designs where the comparison of interest is nested within individuals/animal. For example if you wanted to compare tissue differences for each microbiota. However, you want to compare microbiota differences for each tissue. The problem for analysis with the nesting approach is that you cannot control for the animal-specific effects and compare across microbiota, because these variables are confounded. Whereas, animal and tissue are not confounded, which allows the nesting approach to work.

I'd recommend you subset each tissue into 3 DESeqDataSets and run a design of ~ gut_microbiota for each.

**rozitaa** · 03-31-2015, 03:58 AM

Originally posted by Michael Love View Post

I'd recommend you subset each tissue into 3 DESeqDataSets and run a design of ~ gut_microbiota for each.

Thanks a lot Micheal!

**lbragg** · 06-22-2015, 08:28 PM

Originally posted by Michael Love View Post

Also, I had to update that custom line of size factor code for when all rows have a zero. If you were using that line of code to avoid the issue that all rows of your count matrix have one or more zeros, you should get the edited version:

DESeq2 Error in varianceStabilizingTransformation

https://support.bioconductor.org/p/62246/

We're implementing a solution here which was on the list of todos for a while now, but it won't be out for 6 months. Typical RNA-Seq doesn't have this issue, but since phyloseq we are seeing more metagenomics datasets which often have no genes without a zero.

Hi there,

I have a metatranscriptomic dataset where every gene has at least one zero count. I tried the geoMeans function you've supplied via the link, but I keep getting the following error:

Code:

dds = estimateSizeFactors(dds, geoMeans=geoMeans)
Error in if (any(value <= 0)) { : missing value where TRUE/FALSE needed

This is using DESeq2 1.8.1 on R 3.2.

Happy to provide a MRE if you like.

Lauren

**Michael Love** · 06-23-2015, 05:25 AM

what does summary(geoMeans) give you?

**lbragg** · 06-24-2015, 02:50 PM

Originally posted by Michael Love View Post

what does summary(geoMeans) give you?

Code:

summary(geoMeans)
   Min. 1st Qu.  Median    Mean 3rd Qu.    Max. 
  1.414   2.289   2.828   3.731   4.000  28.650

**Michael Love** · 06-25-2015, 06:19 AM

Another test (I suspect there is a column with all zeros):

table(colSums(counts(dds)) == 0)

**lbragg** · 06-25-2015, 02:06 PM

Originally posted by Michael Love View Post

Another test (I suspect there is a column with all zeros):

table(colSums(counts(dds)) == 0)

You're right, there is one sample with all zeros. Thanks.

Topics	Statistics	Last Post
A Closer Look at the Enigmatic Genomes of Oikopleura dioica by seqadmin Started by seqadmin, Yesterday, 06:35 AM	0 responses 14 views 0 likes	Last Post by seqadmin Yesterday, 06:35 AM
Advanced Epigenome Editing Platform Explores Gene Regulation Mechanisms by seqadmin Started by seqadmin, 05-09-2024, 02:46 PM	0 responses 18 views 0 likes	Last Post by seqadmin 05-09-2024, 02:46 PM
Telomere Maintenance by PARP1: A New Perspective in Cancer Research by seqadmin Started by seqadmin, 05-07-2024, 06:57 AM	0 responses 17 views 0 likes	Last Post by seqadmin 05-07-2024, 06:57 AM
Enhanced Neoantigen Detection: Introducing NeoHunter by seqadmin Started by seqadmin, 05-06-2024, 07:17 AM	0 responses 19 views 0 likes	Last Post by seqadmin 05-06-2024, 07:17 AM

Seqanswers Leaderboard Ad

Announcement

Comment

Comment

Comment

Comment

Comment

Comment

Comment

Comment

Comment

Comment

Latest Articles

ad_right_rmr

News