Seqanswers Leaderboard Ad

**Simon Anders** · 01-18-2013, 12:27 AM

Can you post the plot of dispersion versus mean. Use, e.g.

Code:

plot( 
   rowMeans( counts( ecs, normalized=TRUE ) ), 
   fData(ecs)$dispBeforeSharing, 
   log="xy", pch=19, cex=.3, col="#00000040" )

**vplagnol** · 01-18-2013, 02:19 AM

Thank you for your quick reply Simon.

So if I don't use my "fudge", the coefficients are:
(Intercept) I(1/means[good])
0.0000000 0.9973345

I attach my mean vs dispersion plot, which hopefully will make sense to you.
Note that this is only for a small subset of 300 exons which I use for debugging but the same exact thing happens for the full genome-wide set.

Attached Files

DEXSeq-MeanVsDispPoints.pdf (10.2 KB, 67 views)

**Simon Anders** · 01-18-2013, 02:31 AM

This plot seems perfectly fine to me. Is the red line now done with the original or the fudged coefficients? The intecept (y value of the red line where x=exp(0)=1) looks positive and correct. In any case, it seem to trace the point well, so it should be correct to continue the calculation with this fit.

By the way, you have called the estimateLog2FoldChanges function, have you? Otherwise, it would be no wonder that you don't have fold change estimates.

**vplagnol** · 01-18-2013, 02:49 AM

Argh sorry... this plot was in fact using my fudge. Just tired I suppose. Now here is the problematic plot, I think you will see the issue.

And here are the messed up coefficients for the fit:
> DexSeqExons.loc@dispFitCoefs
(Intercept) I(1/means[good])
-0.0001751386 0.9973344545

Basically my understanding is that the first iteration results in a negative intercept. If I set it back to a positive value and let the algorithm iterate, I get the reasonable graph you saw above. But as it stands the fitting gives up right away as soon as such a negative intercept occurs.

Attached Files

DEXSeq-MeanVsDispPoints.pdf (10.2 KB, 57 views)

**Simon Anders** · 01-18-2013, 03:33 AM

It's annoying that these negative intercepts happen, and we need to figure out how to get rid of them. Your fudge, however, seems to do the trick for your data, because with it, the fit looks reasonable.

After you have modified the dispersion coefficients, use the following code to recalculate the dispersion values from it:

Code:

fData(ecs)$dispFitted <- ecs@dispFitCoefs[1] + 
   ecs@dispFitCoefs[2] / colMeans( t(counts(ecs)) / sizeFactors(ecs) )
fData(ecs)$dispersion <- pmin( pmax( fData(ecs)$dispBeforeSharing, 
   fData(ecs)$dispFitted, na.rm = TRUE), 1e+08)

(These are the last two lines from 'fitDispersionFunction'.)

**vplagnol** · 01-18-2013, 04:45 AM

OK thanks, this is what I needed to hear.

For what it's worth my solution is not very elegant but because I remove the "break" that occurs after the negative coefficients, the fit continues with more iterations and I do not think I need these extra 2 lines. In my situation the first iteration gives negative values, but resetting the starting coefficients after that seems to converge toward proper parameter values after that. This solves that dataset a least.

See below my modification to your fitDispersion function, again no claim to have a fix, just a rough idea:

if (coefs[1] < 0) {
coefs[1] <- 0.001
warning("Negative intercept value in the dispersion function, it will be set to 0. Check fit diagnostics plot section from the vignette.")
#break
}

Topics	Statistics	Last Post
Expanding the Horizons of Cellular Research with the Single Cell Atlas by seqadmin Started by seqadmin, Today, 11:49 AM	0 responses 13 views 0 likes	Last Post by seqadmin Today, 11:49 AM
Genetic Variants and Diabetes Risk in Childhood Cancer Survivors by seqadmin Started by seqadmin, Yesterday, 08:47 AM	0 responses 16 views 0 likes	Last Post by seqadmin Yesterday, 08:47 AM
Cancer Metastasis: A Deep Dive into Cellular Plasticity by seqadmin Started by seqadmin, 04-11-2024, 12:08 PM	0 responses 61 views 0 likes	Last Post by seqadmin 04-11-2024, 12:08 PM
Proteogenomic Profiles Offer New Clues in Prostate Cancer by seqadmin Started by seqadmin, 04-10-2024, 10:19 PM	0 responses 60 views 0 likes	Last Post by seqadmin 04-10-2024, 10:19 PM

Seqanswers Leaderboard Ad

Announcement

plotDEXseq failure- all log2fold values set to NA or NaN

Comment

Comment

Comment

Comment

Comment

Comment

Latest Articles

ad_right_rmr

News