Variational Expectation Maximization scales NLME model fitting to over 15,000 population parameters using flexible variational families and reverse-mode automatic differentiation.
A machine-rendered reading of the paper's core claim, the
machinery that carries it, and where it could break.
Nonlinear mixed effects models are used to study data where each person or subject has slightly different parameters drawn from a larger population, such as how individuals respond differently to the same drug. Fitting these models requires integrating over all possible individual variations, which becomes very slow when there are thousands of parameters. The paper tests whether variational expectation maximization, an approximation technique that replaces the hard integral with a simpler optimized distribution, can make this feasible. They show it works on a standard warfarin model and demonstrate speed on a much larger DeepNLME Friberg model with 15,410 population parameters and 16 random effects. The method uses automatic differentiation to compute updates efficiently without manual derivatives.
Core claim
VEM can efficiently maximize the marginal likelihood, scaling to NLME models with over 15,000 population parameters.
Load-bearing premise
The chosen variational family provides a sufficiently accurate approximation to the true posterior for the large-scale NLME models tested, and the Pumas implementation correctly recovers known results on the warfarin model.
read the original abstract
Nonlinear Mixed Effects models (NLME) models are widely used in pharmacometrics and related fields to analyze hierarchical and longitudinal data. However, as the number of parameters and random effects increases, traditional methods for maximizing the marginal likelihood become computationally expensive. This paper explores the Variational Expectation Maximization (VEM) algorithm, a scalable alternative for fitting NLME models. Originally introduced in the context of probabilistic graphical models and later popularized through variational autoencoders, VEM has not been extensively applied to NLME modeling. By leveraging flexible variational families and reverse-mode automatic differentiation, VEM can efficiently maximize the marginal likelihood, scaling to NLME models with over 15,000 population parameters. This work provides a detailed description of VEM, compares it to other NLME fitting algorithms, and highlights its scalability through computational experiments. Using the Pumas statistical software, we fit two test models: 1) a standard warfarin model, and 2) a DeepNLME Friberg model with 15,410 population parameters and 16 random effects. The warfarin model was fitted to completion to demonstrate the correctness of VEM, while the DeepNLME Friberg model was fitted for a limited number of iterations to measure the time per iteration and demonstrate VEM's scalability.
Editorial analysis
A structured set of objections, weighed in public.
Desk editor's note, referee report, simulated authors' rebuttal, and a
circularity audit. Tearing a paper down is the easy half of reading it; the
pith above is the substance, this is the friction.
Review based on abstract only; no explicit free parameters, axioms, or invented entities are detailed in the provided text.
axioms (1)
domain assumptionA flexible variational family can approximate the intractable posterior in NLME models sufficiently well for optimization purposes Implicit in the use of VEM for marginal likelihood maximization
pith-pipeline@v0.9.0 ·
5535 in / 1176 out tokens ·
59866 ms ·
2026-05-07T12:20:26.567783+00:00
· methodology
discussion (0)
Sign in with ORCID, Apple, or X to comment. Anyone can read and Pith papers without signing in.