Bellman calibration supplies a new reliability criterion and post-hoc recalibration method for value functions in offline RL, with finite-sample guarantees at one-dimensional nonparametric rates that avoid Bellman completeness and realizability assumptions.
Bellman conformal inference: Calibrating prediction intervals for time series.arXiv preprint arXiv:2402.05203
3 Pith papers cite this work. Polarity classification is still indexing.
verdicts
UNVERDICTED 3representative citing papers
A parameter-free algorithm for group-conditional online conformal prediction that achieves optimal coverage guarantees without learning-rate tuning.
Proposes Cost-Aware Adaptive Conformal Inference framework providing dual statistical guarantees on long-run violation frequency and cumulative violation cost for runtime assurance in dynamic environments.
citing papers explorer
-
Bellman Calibration for $V$-Learning in Offline Reinforcement Learning
Bellman calibration supplies a new reliability criterion and post-hoc recalibration method for value functions in offline RL, with finite-sample guarantees at one-dimensional nonparametric rates that avoid Bellman completeness and realizability assumptions.
-
Parameter-Free and Group Conditional Online Conformal Prediction
A parameter-free algorithm for group-conditional online conformal prediction that achieves optimal coverage guarantees without learning-rate tuning.
-
Cost-Aware Adaptive Conformal Inference for Runtime Assurance in Dynamic Environments
Proposes Cost-Aware Adaptive Conformal Inference framework providing dual statistical guarantees on long-run violation frequency and cumulative violation cost for runtime assurance in dynamic environments.