pith. sign in

arxiv: math/0408146 · v1 · submitted 2004-08-11 · 🧮 math.GM · cs.AI· cs.LG

Learning a Machine for the Decision in a Partially Observable Markov Universe

classification 🧮 math.GM cs.AIcs.LG
keywords markovlearningmethodobservableoptimalpartiallyuniverseviewpoint
0
0 comments X
read the original abstract

In this paper, we are interested in optimal decisions in a partially observable Markov universe. Our viewpoint departs from the dynamic programming viewpoint: we are directly approximating an optimal strategic tree depending on the observation. This approximation is made by means of a parameterized probabilistic law. In this paper, a particular family of hidden Markov models, with input and output, is considered as a learning framework. A method for optimizing the parameters of these HMMs is proposed and applied. This optimization method is based on the cross-entropic principle.

This paper has not been read by Pith yet.

discussion (0)

Sign in with ORCID, Apple, or X to comment. Anyone can read and Pith papers without signing in.