pith. sign in

arxiv: 1802.07243 · v1 · pith:P2ES6KRInew · submitted 2018-02-20 · 🧮 math.OC

Value iteration for approximate dynamic programming under convexity

classification 🧮 math.OC
keywords fixediterationoriginalundervalueboundingconvexityform
0
0 comments X
read the original abstract

This paper studies value iteration for infinite horizon contracting Markov decision processes under convexity assumptions and when the state space is uncountable. The original value iteration is replaced with a more tractable form and the fixed points from the modified Bellman operators will be shown to converge uniformly on compacts sets to their original counterparts. This holds under various sampling approaches for the random disturbances. Moreover, this paper will present conditions in which these fixed points form monotone sequences of lower bounding or upper bounding functions for the original fixed point. This approach is then demonstrated numerically on a perpetual Bermudan put option.

This paper has not been read by Pith yet.

discussion (0)

Sign in with ORCID, Apple, or X to comment. Anyone can read and Pith papers without signing in.