PromptPO shows LLMs can act as black-box policy optimizers for sequential RL when leveraging prior knowledge, matching baselines in exploration and robotics but underperforming in MuJoCo.
Reinforcement learning for optimization of covid-19 mitigation policies
4 Pith papers cite this work. Polarity classification is still indexing.
verdicts
UNVERDICTED 4representative citing papers
Hierarchical RL with a global cost controller and local marginal-value policies outperforms RMAB and heuristic baselines by 20-30% in simulated multi-cluster SARS-CoV-2 control.
MARL framework for jurisdiction-specific HIV intervention allocation accounting for cross-jurisdictional interactions outperforms single-agent RL in CA/FL simulations under fixed budgets.
A hierarchical RL simulation of agent behaviors and uncertainty-aware policy optimization shows masking and vaccination reduce epidemic peaks and duration.
citing papers explorer
-
A Multi-Agent Reinforcement Learning Framework for Public Health Decision Analysis
MARL framework for jurisdiction-specific HIV intervention allocation accounting for cross-jurisdictional interactions outperforms single-agent RL in CA/FL simulations under fixed budgets.
-
Neetyabhas: A Framework for Uncertainty-Aware Public Policy Optimization in Rational Agent-Based Models
A hierarchical RL simulation of agent behaviors and uncertainty-aware policy optimization shows masking and vaccination reduce epidemic peaks and duration.