pith. sign in

Off-policy estimation with adaptively collected data: the power of online learning , isbn =

1 Pith paper cite this work. Polarity classification is still indexing.

1 Pith paper citing it

fields

stat.ME 1

years

2026 1

verdicts

UNVERDICTED 1

representative citing papers

Anytime-valid Optimal Policy Identification

stat.ME · 2026-06-16 · unverdicted · novelty 6.0

Constructs a time-indexed set S_t retaining the true optimal policy uniformly over time with high probability, enabling early stopping with sample complexity O((log |Π| + log log(1/Δ_min))/Δ_min²) when the optimum is unique.

citing papers explorer

Showing 1 of 1 citing paper.

  • Anytime-valid Optimal Policy Identification stat.ME · 2026-06-16 · unverdicted · none · ref 3

    Constructs a time-indexed set S_t retaining the true optimal policy uniformly over time with high probability, enabling early stopping with sample complexity O((log |Π| + log log(1/Δ_min))/Δ_min²) when the optimum is unique.