Seqanswers Leaderboard Ad

Collapse

Announcement

Collapse
No announcement yet.
X
 
  • Filter
  • Time
  • Show
Clear All
new posts

  • DEXSeq estimateDispersions error

    Hi everyone,
    I run into a problem when using DEXSeq estimateDispersions command.
    I know the error is because I have only 1 Tumor & Normal RNA-Seq data, which means there's no replicate, but if I still want to deal with the data with DEXSeq how can I solve the problem?

    Samples:
    Code:
                          condition replicate type
    T_DEXSeq.txt     Tumor         1     paired-end
    N_DEXSeq.txt     Normal        1     paired-end
    When I set the nCores=2:
    Code:
    Estimating Cox-Reid exon dispersion estimates using 2 cores. (Progress report: one dot per 100 genes)
    Error in function (classes, fdef, mtable)  :
       unable to find an inherited method for function "fData", for signature "try-error"
    In addition: Warning message:
    In mcLapply(allecs, FUN = funtoapply, mc.cores = mc.cores) :
       all scheduled cores encountered errors in user code
    There's a similar question in http://permalink.gmane.org/gmane.sci...onductor/40321 which occured the same error in another procedure. And Simon said this rarely happened due to high dispersions.

    When I only use one core the error said:
    Code:
    Dispersion estimation. (Progress report: one dot per 100 genes)
    Error in FUN(c("ENSG00000000003", "ENSG00000000419", "ENSG00000000457",  : 
      Underdetermined model; cannot estimate dispersions. Maybe replicates have not been properly specified.
    In addition: Warning messages:
    1: In .local(object, ...) :
      Exons with less than 11 counts will be discarded. For more details read the documentation, parameter minCount
    2: In .local(object, ...) :
      Genes with more than 70 testable exons will be kicked out of the analysis. For more details read the documentation, parameter maxExon
    I'm wondering if I could do some manipulation to avoid the error? Such as change minCount or artificially remove the genes the error mentioned? Thanks!

  • #2
    Seems that in the multicore setting, the correct error message gets overwritten by something less helpful. We need to check this.

    In the single-core call, DEXSeq reports correctly reports: "Underdetermined model; cannot estimate dispersions. Maybe replicates have not been properly specified."

    I guess I don't need to tell you that you cannot extract any biological results from comparing a single normal with a single tumour sample, and that your analysis is hence pointless, anyway.

    Comment


    • #3
      Originally posted by Simon Anders View Post
      Seems that in the multicore setting, the correct error message gets overwritten by something less helpful. We need to check this.

      In the single-core call, DEXSeq reports correctly reports: "Underdetermined model; cannot estimate dispersions. Maybe replicates have not been properly specified."

      I guess I don't need to tell you that you cannot extract any biological results from comparing a single normal with a single tumour sample, and that your analysis is hence pointless, anyway.
      Thank you for replying Simon.
      I know and you have mentioned the meaningless of RNA-Seq data with no replicates for many times. To my standpoint, I knew DESeq has a section of "Working partially without replicates", that's the reason why I guess if there's some way to avoid this error in DEXSeq (maybe that's not very logical..), so your answer is NO, am I right?
      Thank you.

      Comment


      • #4
        Exactly. In principle you can also use DEXSeq in the "blind" mode that we offered for DESeq, but I cannot see any circumstances where this would make sense.

        Comment


        • #5
          Originally posted by Simon Anders View Post
          Exactly. In principle you can also use DEXSeq in the "blind" mode that we offered for DESeq, but I cannot see any circumstances where this would make sense.
          I see. Thank you Simon.

          Comment


          • #6
            Originally posted by Simon Anders View Post
            Exactly. In principle you can also use DEXSeq in the "blind" mode that we offered for DESeq, but I cannot see any circumstances where this would make sense.
            In principle but non in practice right now, am I wrong?
            They are asking me to do so, I suppose that I need to compare DESeq and DEXSeq codes and try to add the blind support there.

            ps.
            Code:
            > counts <- estimateDispersions(counts)
            Dispersion estimation. (Progress report: one dot per 100 genes)
            Error in FUN(c("ENSG00000000003", "ENSG00000000005", "ENSG00000000419",  :
              Underdetermined model; cannot estimate dispersions. Maybe replicates have not been properly specified.
            In addition: Warning messages:
            1: In .local(object, ...) :
              Exons with less than 11 counts will be discarded. For more details read the documentation, parameter minCount
            2: In .local(object, ...) :
              Genes with more than 70 testable exons will be kicked out of the analysis. For more details read the documentation, parameter maxExon
            > counts <- estimateDispersions(counts, nCores=nCores, method="blind")
            Error in .local(object, ...) : unused argument(s) (method = "blind")
            > counts <- estimateDispersions(counts, method="blind")
            Error in .local(object, ...) : unused argument(s) (method = "blind")

            Comment


            • #7
              In principle but non in practice right now, am I wrong? They are asking me to do so, I suppose that I need to compare DESeq and DEXSeq codes and try to add the blind support there.
              Not necessarily, just remove the lines

              Code:
              if( nrow(mm) <= ncol(mm) )
                 stop( "Underdetermined model; cannot estimate dispersions. Maybe replicates have not been properly specified." )
              But of course, those lines have a VERY GOOD reason to be there

              Alejandro

              Comment


              • #8
                I see. Right now I have data without replicates and they are asking me to do a preliminary differential isoforms usage test, so I guess that I have to find a way Maybe I can try with one of the other ton of packages (MISO, etc), which is always an entertaining job...

                I commented out that lines and recompiled DEXSeq but now the next step is having troubles, I guess that reading all the code is the right option now, which is more interesting than trying other packages randomly

                counts <- estimateDispersions(counts, nCores=nCores)
                Estimating Cox-Reid exon dispersion estimates using 3 cores. (Progress report: one dot per 100 genes)
                .........................................................................................................................................................> counts <- fitDispersionFunction(counts)
                Error in glmgam.fit(mm, disps[good], start = coefs) :
                More columns than rows in X
                In addition: Warning message:
                In is.na(rows) : is.na() applied to non-(list or vector) of type 'NULL'
                Error in fitDispersionFunction(counts) :
                Failed to fit the dispersion function
                >

                Comment


                • #9
                  Of course, you cannot estimate dispersion without replicates. So you have to come up with some fantasy value and plug it in, with "fData(ecs)$dispersion <- 0.1".

                  The "0.1" here is just a place holder. I will not venture a guess what an appropriate value might be.

                  Comment


                  • #10
                    Originally posted by Simon Anders View Post
                    Of course, you cannot estimate dispersion without replicates. So you have to come up with some fantasy value and plug it in, with "fData(ecs)$dispersion <- 0.1".

                    The "0.1" here is just a place holder. I will not venture a guess what an appropriate value might be.
                    Yup, thanks. My idea was to check which value is used with the blind method in DESeq (which gave results superimposable with other tools) and give it a try. I'm well aware of your reasonable need for biological replicates, but I'm not the biggest statistical head here and when they ask me to do something which seems reasonable to them I have at least to try

                    Comment


                    • #11
                      Originally posted by areyes View Post
                      Not necessarily, just remove the lines

                      Code:
                      if( nrow(mm) <= ncol(mm) )
                         stop( "Underdetermined model; cannot estimate dispersions. Maybe replicates have not been properly specified." )
                      But of course, those lines have a VERY GOOD reason to be there

                      Alejandro
                      HI,

                      I want to ask how to remove this line. Since where I can find the raw scripts in R installation folders.

                      Thanks..

                      Comment


                      • #12
                        Originally posted by camelbbs View Post
                        HI,

                        I want to ask how to remove this line. Since where I can find the raw scripts in R installation folders.

                        Thanks..
                        See http://www.bioconductor.org/developers
                        "Source code is stored in svn (user: readonly, pass: readonly)."
                        Go to the "R" directory of the DESeq package, and grep for the keyword you are looking for.

                        Best wishes
                        Wolfgang
                        Wolfgang Huber
                        EMBL

                        Comment


                        • #13
                          Originally posted by Wolfgang Huber View Post
                          See http://www.bioconductor.org/developers
                          "Source code is stored in svn (user: readonly, pass: readonly)."
                          Go to the "R" directory of the DESeq package, and grep for the keyword you are looking for.

                          Best wishes
                          Wolfgang
                          Hi Wolfgang,
                          Thanks very much. But while I downloaded the core_function.R and modified it, I get an error to load it:
                          Code:
                          > source("C:\\Users\\cjhe\\Desktop\\dexseq\\core_functions.R")
                          Error in setMethod("estimateSizeFactors", signature(object = "ExonCountSet"),  : 
                            no existing definition for function ‘estimateSizeFactors’

                          Comment


                          • #14
                            Originally posted by Wolfgang Huber View Post
                            See http://www.bioconductor.org/developers
                            "Source code is stored in svn (user: readonly, pass: readonly)."
                            Go to the "R" directory of the DESeq package, and grep for the keyword you are looking for.

                            Best wishes
                            Wolfgang
                            Hi Wolfgang,

                            While I comment the line:
                            Code:
                            if( nrow(mm) <= ncol(mm) )
                               stop( "Underdetermined model; cannot estimate dispersions. Maybe replicates have not been properly specified." )
                            and run estimateDispersions, i got error:

                            Code:
                            Error in match.arg(start.method, c("log(y)", "mean")) : 
                              'arg' must be NULL or a character vector
                            can you help for that? Thanks a lot!

                            Comment


                            • #15
                              Dear camelbbs,

                              have a look on this post:
                              Discussion of next-gen sequencing related bioinformatics: resources, algorithms, open source efforts, etc


                              bests,
                              Alejandro

                              Comment

                              Latest Articles

                              Collapse

                              • seqadmin
                                Essential Discoveries and Tools in Epitranscriptomics
                                by seqadmin




                                The field of epigenetics has traditionally concentrated more on DNA and how changes like methylation and phosphorylation of histones impact gene expression and regulation. However, our increased understanding of RNA modifications and their importance in cellular processes has led to a rise in epitranscriptomics research. “Epitranscriptomics brings together the concepts of epigenetics and gene expression,” explained Adrien Leger, PhD, Principal Research Scientist...
                                04-22-2024, 07:01 AM
                              • seqadmin
                                Current Approaches to Protein Sequencing
                                by seqadmin


                                Proteins are often described as the workhorses of the cell, and identifying their sequences is key to understanding their role in biological processes and disease. Currently, the most common technique used to determine protein sequences is mass spectrometry. While still a valuable tool, mass spectrometry faces several limitations and requires a highly experienced scientist familiar with the equipment to operate it. Additionally, other proteomic methods, like affinity assays, are constrained...
                                04-04-2024, 04:25 PM

                              ad_right_rmr

                              Collapse

                              News

                              Collapse

                              Topics Statistics Last Post
                              Started by seqadmin, Yesterday, 08:47 AM
                              0 responses
                              16 views
                              0 likes
                              Last Post seqadmin  
                              Started by seqadmin, 04-11-2024, 12:08 PM
                              0 responses
                              60 views
                              0 likes
                              Last Post seqadmin  
                              Started by seqadmin, 04-10-2024, 10:19 PM
                              0 responses
                              60 views
                              0 likes
                              Last Post seqadmin  
                              Started by seqadmin, 04-10-2024, 09:21 AM
                              0 responses
                              54 views
                              0 likes
                              Last Post seqadmin  
                              Working...
                              X