vsOED uses a variational one-point reward and RL policy optimization to provide a lower bound on expected information gain for sequential experimental design, supporting nuisance parameters, implicit likelihoods, and multiple design goals.
Title resolution pending
2 Pith papers cite this work. Polarity classification is still indexing.
verdicts
UNVERDICTED 2representative citing papers
A constrained density-ratio network approximates the Radon-Nikodym derivative and feeds an anytime PAC-Bayes certificate for learning under covariate shift, validated via synthetic patch tests and real-data deployment.
citing papers explorer
-
Variational Sequential Optimal Experimental Design using Reinforcement Learning
vsOED uses a variational one-point reward and RL policy optimization to provide a lower bound on expected information gain for sequential experimental design, supporting nuisance parameters, implicit likelihoods, and multiple design goals.
-
Anytime PAC-Bayes for Constrained Density-Ratio Networks under Covariate Shift
A constrained density-ratio network approximates the Radon-Nikodym derivative and feeds an anytime PAC-Bayes certificate for learning under covariate shift, validated via synthetic patch tests and real-data deployment.