Cross-Entropic Learning of a Machine for the Decision in a Partially Observable Universe
classification
🧮 math.OC
cs.AIcs.LGcs.NEcs.ROmath.STstat.TH
keywords
observablepartiallyuniversecross-entropicdecisionlearningmachinemarkov
read the original abstract
Revision of the paper previously entitled "Learning a Machine for the Decision in a Partially Observable Markov Universe" In this paper, we are interested in optimal decisions in a partially observable universe. Our approach is to directly approximate an optimal strategic tree depending on the observation. This approximation is made by means of a parameterized probabilistic law. A particular family of hidden Markov models, with input \emph{and} output, is considered as a model of policy. A method for optimizing the parameters of these HMMs is proposed and applied. This optimization is based on the cross-entropic principle for rare events simulation developed by Rubinstein.
This paper has not been read by Pith yet.
discussion (0)
Sign in with ORCID, Apple, or X to comment. Anyone can read and Pith papers without signing in.