A support-aware offline decision framework for reserve-policy selection that outputs certified policies and shortlists instead of rankings, with a finite-catalog guarantee preserving the best supported policy.
Miroslav Dudík, John Langford, and Lihong Li
2 Pith papers cite this work. Polarity classification is still indexing.
2
Pith papers citing it
years
2026 2verdicts
UNVERDICTED 2representative citing papers
A support-aware DSS integrates replay, OPE, lower-bound ranking, multi-sided guardrails, out-of-time validation, and interference-aware design to output launch-readiness classifications rather than single performance estimates, applied to RTB logs where a margin-gated floor policy is selected for va
citing papers explorer
-
Decision Support for Marketplace Policies under Incomplete Evidence: From Replay to Launch Readiness
A support-aware DSS integrates replay, OPE, lower-bound ranking, multi-sided guardrails, out-of-time validation, and interference-aware design to output launch-readiness classifications rather than single performance estimates, applied to RTB logs where a margin-gated floor policy is selected for va