pith. sign in

Sample Complexity for Markov Decision Processes and Stochastic Optimal Control with Static Risk Measures

1 Pith paper cite this work. Polarity classification is still indexing.

1 Pith paper citing it
abstract

We present an elementary state augmentation method for a class of static risk measure applied to the total cost for both Markov decision processes and stochastic optimal control, such that dynamic programming equations can be derived on the augmented space. Through this we discuss the sample complexities of these two problems for both finite-horizon and infinite-horizon settings. We demonstrate the application of the proposed approach through studying distributionally robust functional generated by $\phi$-divergences including conditional value-at-risk.

fields

math.OC 1

years

2026 1

verdicts

UNVERDICTED 1

representative citing papers

citing papers explorer

Showing 1 of 1 citing paper.

  • Sample Average Approximation for Distributionally Robust Optimization with $\phi$-divergences math.OC · 2026-04-12 · unverdicted · none · ref 2 · internal anchor

    For φ-divergences with superlinear growth, sample average approximation achieves P-independent sample complexity for worst-case expectation estimation depending only on φ's growth, ball radius and precision, with optimality via lower bounds; non-superlinear φ yields unbounded P-dependent complexity.