Seqanswers Leaderboard Ad

Collapse

Announcement

Collapse
No announcement yet.
X
 
  • Filter
  • Time
  • Show
Clear All
new posts

  • Error Message in nbinomLRT in DESeq2

    Hello all. I am working on some code for my lab and I have run into an error message I can seem to get around. My goal is to test each variable in the model given all other variables are present. Here is the code and the error that are tripping me up.

    DESrun=nbinomLRT(DESrun, useOptim=FALSE, maxit = 500 reduced=as.formula(paste("~",paste(Vars[-i],collapse="+"))))

    Error in optim(betaRow, objectiveFn, method = "L-BFGS-B", lower = -large, :
    L-BFGS-B needs finite values of 'fn'


    Vars contain my variables of interest, for example Vars = c("Age","Gender","Disorder"). And DESrun has my full model, dispersion estimates, size factor estimates, and data. I am using DESeq2 version 1.0.18.

    If I am still getting this error with the useOptim set to false and maxit set to 500 anyone have a suggestion?

    Thanks,
    Todd

  • #2
    Hi,

    I have a pretty similar problem :

    Error in optim(betaRow, objectiveFn, method = "L-BFGS-B", lower = -large, :
    non-finite finite-difference value [3]


    What is weird is that i'm pretty sure it worked with an older version of DESeq2 as i'm re-running an old script.

    Did you find a solution ?

    Comment


    • #3
      Can you post the relevant portion of your script (including the design data.frame)? I suspect that something is off in your fit function.

      Comment


      • #4
        No solution yet. But, I have been in contact with Mike Love and he is tracking it down. He sent me the following to see which rows are causing an issue.


        if you do:

        debug(DESeq2:::fitNbinomGLMs)

        this will show which rows are crashing.

        nbinomLRT first runs fitNbinomGLMs on the full formula which is not
        throwing the error. So you can hit enter through the function once.
        The second time it is fitting the reduced formula. On the second time,
        after this code chunk, you can identify the rows causing the problem:

        # switch based on whether we should also use optim
        # on rows which did not converge
        if (useOptim) {
        rowsForOptim <- which(!betaConv | !rowStable | !rowVarPositive)
        } else {
        rowsForOptim <- which(!rowStable | !rowVarPositive)
        }

        The rowsForOptim is the subset of rows which are causing problems.

        My current plan is to use this to find the rows causing a problem and remove them, then rerun DESeq2.

        Comment


        • #5
          I now am committing DESeq2 version 1.0.19 to Bioc which uses Nelder-Mead for this back-up optimization rather than L-BFGS-B. This appears to avoid this halting error.

          Furthermore, for Todd's data, it is necessary, for now, to also standardize the numeric predictors, e.g.:

          # scale the numeric predictors
          for (var in c("age","weight")) {
          colData(dds)[[var]] <- as.numeric(scale(colData(dds)[[var]]))
          }

          Then in the end, the log2 fold changes and lfcSE can be multiplied by sd(age) to obtain what would have been the original log2 fold change expected per year for the age variable. (Or you can leave them as they are and they are interpretable as log2 fold changes expected per standard deviation)

          I am working on improving the fitting algorithm in the devel branch, so that this manual scaling will not be necessary.

          Comment


          • #6
            Hi,

            I was using a continuous variable that was the number of bacteria as a covariate.
            This value scales from ~ 1e4 to 1e7
            If i use this scale, i have the error message : "Error in optim(betaRow, objectiveFn, method = "L-BFGS-B", lower = -large, : non-finite finite-difference value [3]"
            If i instead divide my values by 1e6 (it's now millions of bacteria), it works.

            Comment


            • #7
              Are you sure that you don't want to rather regress on the logarithm of your predictor, given its huge dynamic range?

              Comment


              • #8
                Originally posted by Simon Anders View Post
                Are you sure that you don't want to rather regress on the logarithm of your predictor, given its huge dynamic range?
                That's a good idea, thanks!
                I'll check that !

                Comment


                • #9
                  I have implemented a change to the fitting algorithm in DESeq2 version >= 1.1.32 which should eliminate the need to standardize predictors as described in my post above.

                  Comment


                  • #10
                    Hi Michael,

                    Here is a part of my code :

                    Code:
                    colData <- data.frame(
                    	AFcomp = sub_individuals$AFcomp,
                    	bacteria_count = sub_individuals$Contact,
                    	batch = as.factor(paste0("F", sub_individuals$Flowcell)))
                    
                    dds <- DESeqDataSetFromMatrix(
                    	countData = sub_reads,
                    	colData = colData,
                    	design = ~ AFcomp + bacteria_count + batch)
                    
                    dds <- estimateSizeFactors(dds)
                    dds <- estimateDispersions(dds, maxit = 500, quiet = TRUE)
                    dds <- nbinomLRT(dds, full = design(dds), maxit = 500, reduced = ~ bacteria_count + batch)
                    Where :
                    • AFcomp is a continuous variable scaling from 0 to 1.
                    • bacteria_count is a continuous variable from 1e6 to 8e6.
                    • batch is a discrete variable (batch name).


                    if i use the bacteria_count like that, i get this error :

                    Code:
                    Error in solve.default(xtwx + ridge) :
                      system is computationally singular: reciprocal condition number = 9.67189e-20
                    If i divide the bacteria_count by 1e6 (now looking at millions of bacteria), it works.

                    I'm using the 1.1.33 version from the dev branch of bioconductor.

                    Do you have an idea?

                    Thanks!
                    Last edited by Yohann; 09-04-2013, 10:18 AM.

                    Comment


                    • #11
                      hi Yohann,

                      I see the problem and will look into it.

                      Comment


                      • #12
                        hi Yohann,

                        The problem was coming from some code which goes over rows which did not converge in the standard GLM fitting steps using R's optimization function. So I had not updated this code to account for very large columns in the design matrix. I have now updated this bit of code in version 1.1.35 and added a unit test to confirm that the coefficients and standard errors are identical with or without this extra optimization loop.

                        If you could check if this solves your error that would be great.

                        Comment


                        • #13
                          hi Michael, thanks for your answer.

                          It looks like i can only retrieve the 1.1.34 version on http://bioconductor.org/packages/dev...ml/DESeq2.html.

                          Is there an other way to get the 1.1.35 ?

                          Thanks,

                          Comment


                          • #14
                            It should show up on the Bioc site around this time tomorrow.

                            Comment

                            Latest Articles

                            Collapse

                            • seqadmin
                              Strategies for Sequencing Challenging Samples
                              by seqadmin


                              Despite advancements in sequencing platforms and related sample preparation technologies, certain sample types continue to present significant challenges that can compromise sequencing results. Pedro Echave, Senior Manager of the Global Business Segment at Revvity, explained that the success of a sequencing experiment ultimately depends on the amount and integrity of the nucleic acid template (RNA or DNA) obtained from a sample. “The better the quality of the nucleic acid isolated...
                              03-22-2024, 06:39 AM
                            • seqadmin
                              Techniques and Challenges in Conservation Genomics
                              by seqadmin



                              The field of conservation genomics centers on applying genomics technologies in support of conservation efforts and the preservation of biodiversity. This article features interviews with two researchers who showcase their innovative work and highlight the current state and future of conservation genomics.

                              Avian Conservation
                              Matthew DeSaix, a recent doctoral graduate from Kristen Ruegg’s lab at The University of Colorado, shared that most of his research...
                              03-08-2024, 10:41 AM

                            ad_right_rmr

                            Collapse

                            News

                            Collapse

                            Topics Statistics Last Post
                            Started by seqadmin, Yesterday, 06:37 PM
                            0 responses
                            8 views
                            0 likes
                            Last Post seqadmin  
                            Started by seqadmin, Yesterday, 06:07 PM
                            0 responses
                            8 views
                            0 likes
                            Last Post seqadmin  
                            Started by seqadmin, 03-22-2024, 10:03 AM
                            0 responses
                            49 views
                            0 likes
                            Last Post seqadmin  
                            Started by seqadmin, 03-21-2024, 07:32 AM
                            0 responses
                            67 views
                            0 likes
                            Last Post seqadmin  
                            Working...
                            X