Seqanswers Leaderboard Ad

Collapse

Announcement

Collapse
No announcement yet.
X
 
  • Filter
  • Time
  • Show
Clear All
new posts

  • DESeq2: rows did not converge in beta

    I'm trying to run a rather complicated model in DESeq2 with 16S microbiome data. More specifically, the model is "~ confounder + diseaseStatus:subject + timepoint*diseaseStatus" (a two-timepoint case-control comparison, with the same subjects at both timepoints; all variables are factors, except for the confounder, which is numeric). Probably because of how complicated this is, I keep running into the "rows did not converge in beta, labelled in mcols(object)$betaConv. Use larger maxit argument with nbinomWaldTest" issue. Now, this wouldn't be such a big problem otherwise, but many of the rows that don't converge in beta represent microbial taxa that I am very much interested in, and would like to have results for.

    First of all, the output still includes a full set of results, complete with p-values, for the rows that were labeled as non-converging. Are these good for anything, or should I just ignore them completely? Some old discussions I found on this topic suggest deleting all the non-converging rows from the output. The DESeq2 documentation mentions a useOptim parameter, "whether to use the native optim function on rows which do not converge within maxit", for nbinomWaldTest, which is by default TRUE, and I assume is relevant to what's going on here, but I don't really understand what this means.

    Secondly, is there any way to get around this issue? I've looked at everything google/earlier conversations suggest as solutions for this, which mostly comes down to trimming the data and increasing the maxit value for nbinomWaldTest. I've tried both, and neither trimming the data aggressively nor increasing the maxit several orders of magnitude (from the default 100 all the way to 1 000 000) help lower the number of non-converging rows. I'd be extremely thankful for any additional ideas that might help.

    (I guess it might come down to "run a simpler model". I'm just fond of running it as one comparison using all the samples at once, instead of subsetting it by timepoint or some such.)

Latest Articles

Collapse

  • seqadmin
    Current Approaches to Protein Sequencing
    by seqadmin


    Proteins are often described as the workhorses of the cell, and identifying their sequences is key to understanding their role in biological processes and disease. Currently, the most common technique used to determine protein sequences is mass spectrometry. While still a valuable tool, mass spectrometry faces several limitations and requires a highly experienced scientist familiar with the equipment to operate it. Additionally, other proteomic methods, like affinity assays, are constrained...
    04-04-2024, 04:25 PM
  • seqadmin
    Strategies for Sequencing Challenging Samples
    by seqadmin


    Despite advancements in sequencing platforms and related sample preparation technologies, certain sample types continue to present significant challenges that can compromise sequencing results. Pedro Echave, Senior Manager of the Global Business Segment at Revvity, explained that the success of a sequencing experiment ultimately depends on the amount and integrity of the nucleic acid template (RNA or DNA) obtained from a sample. “The better the quality of the nucleic acid isolated...
    03-22-2024, 06:39 AM

ad_right_rmr

Collapse

News

Collapse

Topics Statistics Last Post
Started by seqadmin, 04-11-2024, 12:08 PM
0 responses
18 views
0 likes
Last Post seqadmin  
Started by seqadmin, 04-10-2024, 10:19 PM
0 responses
22 views
0 likes
Last Post seqadmin  
Started by seqadmin, 04-10-2024, 09:21 AM
0 responses
17 views
0 likes
Last Post seqadmin  
Started by seqadmin, 04-04-2024, 09:00 AM
0 responses
48 views
0 likes
Last Post seqadmin  
Working...
X