Seqanswers Leaderboard Ad

Collapse

Announcement

Collapse
No announcement yet.
X
 
  • Filter
  • Time
  • Show
Clear All
new posts

  • DEXSeq estimateDispersions error

    Hi everyone,
    I run into a problem when using DEXSeq estimateDispersions command.
    I know the error is because I have only 1 Tumor & Normal RNA-Seq data, which means there's no replicate, but if I still want to deal with the data with DEXSeq how can I solve the problem?

    Samples:
    Code:
                          condition replicate type
    T_DEXSeq.txt     Tumor         1     paired-end
    N_DEXSeq.txt     Normal        1     paired-end
    When I set the nCores=2:
    Code:
    Estimating Cox-Reid exon dispersion estimates using 2 cores. (Progress report: one dot per 100 genes)
    Error in function (classes, fdef, mtable)  :
       unable to find an inherited method for function "fData", for signature "try-error"
    In addition: Warning message:
    In mcLapply(allecs, FUN = funtoapply, mc.cores = mc.cores) :
       all scheduled cores encountered errors in user code
    There's a similar question in http://permalink.gmane.org/gmane.sci...onductor/40321 which occured the same error in another procedure. And Simon said this rarely happened due to high dispersions.

    When I only use one core the error said:
    Code:
    Dispersion estimation. (Progress report: one dot per 100 genes)
    Error in FUN(c("ENSG00000000003", "ENSG00000000419", "ENSG00000000457",  : 
      Underdetermined model; cannot estimate dispersions. Maybe replicates have not been properly specified.
    In addition: Warning messages:
    1: In .local(object, ...) :
      Exons with less than 11 counts will be discarded. For more details read the documentation, parameter minCount
    2: In .local(object, ...) :
      Genes with more than 70 testable exons will be kicked out of the analysis. For more details read the documentation, parameter maxExon
    I'm wondering if I could do some manipulation to avoid the error? Such as change minCount or artificially remove the genes the error mentioned? Thanks!

  • #2
    Seems that in the multicore setting, the correct error message gets overwritten by something less helpful. We need to check this.

    In the single-core call, DEXSeq reports correctly reports: "Underdetermined model; cannot estimate dispersions. Maybe replicates have not been properly specified."

    I guess I don't need to tell you that you cannot extract any biological results from comparing a single normal with a single tumour sample, and that your analysis is hence pointless, anyway.

    Comment


    • #3
      Originally posted by Simon Anders View Post
      Seems that in the multicore setting, the correct error message gets overwritten by something less helpful. We need to check this.

      In the single-core call, DEXSeq reports correctly reports: "Underdetermined model; cannot estimate dispersions. Maybe replicates have not been properly specified."

      I guess I don't need to tell you that you cannot extract any biological results from comparing a single normal with a single tumour sample, and that your analysis is hence pointless, anyway.
      Thank you for replying Simon.
      I know and you have mentioned the meaningless of RNA-Seq data with no replicates for many times. To my standpoint, I knew DESeq has a section of "Working partially without replicates", that's the reason why I guess if there's some way to avoid this error in DEXSeq (maybe that's not very logical..), so your answer is NO, am I right?
      Thank you.

      Comment


      • #4
        Exactly. In principle you can also use DEXSeq in the "blind" mode that we offered for DESeq, but I cannot see any circumstances where this would make sense.

        Comment


        • #5
          Originally posted by Simon Anders View Post
          Exactly. In principle you can also use DEXSeq in the "blind" mode that we offered for DESeq, but I cannot see any circumstances where this would make sense.
          I see. Thank you Simon.

          Comment


          • #6
            Originally posted by Simon Anders View Post
            Exactly. In principle you can also use DEXSeq in the "blind" mode that we offered for DESeq, but I cannot see any circumstances where this would make sense.
            In principle but non in practice right now, am I wrong?
            They are asking me to do so, I suppose that I need to compare DESeq and DEXSeq codes and try to add the blind support there.

            ps.
            Code:
            > counts <- estimateDispersions(counts)
            Dispersion estimation. (Progress report: one dot per 100 genes)
            Error in FUN(c("ENSG00000000003", "ENSG00000000005", "ENSG00000000419",  :
              Underdetermined model; cannot estimate dispersions. Maybe replicates have not been properly specified.
            In addition: Warning messages:
            1: In .local(object, ...) :
              Exons with less than 11 counts will be discarded. For more details read the documentation, parameter minCount
            2: In .local(object, ...) :
              Genes with more than 70 testable exons will be kicked out of the analysis. For more details read the documentation, parameter maxExon
            > counts <- estimateDispersions(counts, nCores=nCores, method="blind")
            Error in .local(object, ...) : unused argument(s) (method = "blind")
            > counts <- estimateDispersions(counts, method="blind")
            Error in .local(object, ...) : unused argument(s) (method = "blind")

            Comment


            • #7
              In principle but non in practice right now, am I wrong? They are asking me to do so, I suppose that I need to compare DESeq and DEXSeq codes and try to add the blind support there.
              Not necessarily, just remove the lines

              Code:
              if( nrow(mm) <= ncol(mm) )
                 stop( "Underdetermined model; cannot estimate dispersions. Maybe replicates have not been properly specified." )
              But of course, those lines have a VERY GOOD reason to be there

              Alejandro

              Comment


              • #8
                I see. Right now I have data without replicates and they are asking me to do a preliminary differential isoforms usage test, so I guess that I have to find a way Maybe I can try with one of the other ton of packages (MISO, etc), which is always an entertaining job...

                I commented out that lines and recompiled DEXSeq but now the next step is having troubles, I guess that reading all the code is the right option now, which is more interesting than trying other packages randomly

                counts <- estimateDispersions(counts, nCores=nCores)
                Estimating Cox-Reid exon dispersion estimates using 3 cores. (Progress report: one dot per 100 genes)
                .........................................................................................................................................................> counts <- fitDispersionFunction(counts)
                Error in glmgam.fit(mm, disps[good], start = coefs) :
                More columns than rows in X
                In addition: Warning message:
                In is.na(rows) : is.na() applied to non-(list or vector) of type 'NULL'
                Error in fitDispersionFunction(counts) :
                Failed to fit the dispersion function
                >

                Comment


                • #9
                  Of course, you cannot estimate dispersion without replicates. So you have to come up with some fantasy value and plug it in, with "fData(ecs)$dispersion <- 0.1".

                  The "0.1" here is just a place holder. I will not venture a guess what an appropriate value might be.

                  Comment


                  • #10
                    Originally posted by Simon Anders View Post
                    Of course, you cannot estimate dispersion without replicates. So you have to come up with some fantasy value and plug it in, with "fData(ecs)$dispersion <- 0.1".

                    The "0.1" here is just a place holder. I will not venture a guess what an appropriate value might be.
                    Yup, thanks. My idea was to check which value is used with the blind method in DESeq (which gave results superimposable with other tools) and give it a try. I'm well aware of your reasonable need for biological replicates, but I'm not the biggest statistical head here and when they ask me to do something which seems reasonable to them I have at least to try

                    Comment


                    • #11
                      Originally posted by areyes View Post
                      Not necessarily, just remove the lines

                      Code:
                      if( nrow(mm) <= ncol(mm) )
                         stop( "Underdetermined model; cannot estimate dispersions. Maybe replicates have not been properly specified." )
                      But of course, those lines have a VERY GOOD reason to be there

                      Alejandro
                      HI,

                      I want to ask how to remove this line. Since where I can find the raw scripts in R installation folders.

                      Thanks..

                      Comment


                      • #12
                        Originally posted by camelbbs View Post
                        HI,

                        I want to ask how to remove this line. Since where I can find the raw scripts in R installation folders.

                        Thanks..
                        See http://www.bioconductor.org/developers
                        "Source code is stored in svn (user: readonly, pass: readonly)."
                        Go to the "R" directory of the DESeq package, and grep for the keyword you are looking for.

                        Best wishes
                        Wolfgang
                        Wolfgang Huber
                        EMBL

                        Comment


                        • #13
                          Originally posted by Wolfgang Huber View Post
                          See http://www.bioconductor.org/developers
                          "Source code is stored in svn (user: readonly, pass: readonly)."
                          Go to the "R" directory of the DESeq package, and grep for the keyword you are looking for.

                          Best wishes
                          Wolfgang
                          Hi Wolfgang,
                          Thanks very much. But while I downloaded the core_function.R and modified it, I get an error to load it:
                          Code:
                          > source("C:\\Users\\cjhe\\Desktop\\dexseq\\core_functions.R")
                          Error in setMethod("estimateSizeFactors", signature(object = "ExonCountSet"),  : 
                            no existing definition for function ‘estimateSizeFactors’

                          Comment


                          • #14
                            Originally posted by Wolfgang Huber View Post
                            See http://www.bioconductor.org/developers
                            "Source code is stored in svn (user: readonly, pass: readonly)."
                            Go to the "R" directory of the DESeq package, and grep for the keyword you are looking for.

                            Best wishes
                            Wolfgang
                            Hi Wolfgang,

                            While I comment the line:
                            Code:
                            if( nrow(mm) <= ncol(mm) )
                               stop( "Underdetermined model; cannot estimate dispersions. Maybe replicates have not been properly specified." )
                            and run estimateDispersions, i got error:

                            Code:
                            Error in match.arg(start.method, c("log(y)", "mean")) : 
                              'arg' must be NULL or a character vector
                            can you help for that? Thanks a lot!

                            Comment


                            • #15
                              Dear camelbbs,

                              have a look on this post:
                              Discussion of next-gen sequencing related bioinformatics: resources, algorithms, open source efforts, etc


                              bests,
                              Alejandro

                              Comment

                              Latest Articles

                              Collapse

                              • seqadmin
                                Advancing Precision Medicine for Rare Diseases in Children
                                by seqadmin




                                Many organizations study rare diseases, but few have a mission as impactful as Rady Children’s Institute for Genomic Medicine (RCIGM). “We are all about changing outcomes for children,” explained Dr. Stephen Kingsmore, President and CEO of the group. The institute’s initial goal was to provide rapid diagnoses for critically ill children and shorten their diagnostic odyssey, a term used to describe the long and arduous process it takes patients to obtain an accurate...
                                12-16-2024, 07:57 AM
                              • seqadmin
                                Recent Advances in Sequencing Technologies
                                by seqadmin



                                Innovations in next-generation sequencing technologies and techniques are driving more precise and comprehensive exploration of complex biological systems. Current advancements include improved accessibility for long-read sequencing and significant progress in single-cell and 3D genomics. This article explores some of the most impactful developments in the field over the past year.

                                Long-Read Sequencing
                                Long-read sequencing has seen remarkable advancements,...
                                12-02-2024, 01:49 PM

                              ad_right_rmr

                              Collapse

                              News

                              Collapse

                              Topics Statistics Last Post
                              Started by seqadmin, 12-17-2024, 10:28 AM
                              0 responses
                              26 views
                              0 likes
                              Last Post seqadmin  
                              Started by seqadmin, 12-13-2024, 08:24 AM
                              0 responses
                              43 views
                              0 likes
                              Last Post seqadmin  
                              Started by seqadmin, 12-12-2024, 07:41 AM
                              0 responses
                              29 views
                              0 likes
                              Last Post seqadmin  
                              Started by seqadmin, 12-11-2024, 07:45 AM
                              0 responses
                              42 views
                              0 likes
                              Last Post seqadmin  
                              Working...
                              X