pith. sign in

arxiv: 1801.00048 · v1 · pith:PQG5GKSCnew · submitted 2017-12-29 · 💻 cs.SY · cs.AI· cs.SY· q-bio.NC

Characterizing optimal hierarchical policy inference on graphs via non-equilibrium thermodynamics

classification 💻 cs.SY cs.AIcs.SYq-bio.NC
keywords controlhierarchiesinferenceoptimalpolicyalgorithmdiscretegraphs
0
0 comments X
read the original abstract

Hierarchies are of fundamental interest in both stochastic optimal control and biological control due to their facilitation of a range of desirable computational traits in a control algorithm and the possibility that they may form a core principle of sensorimotor and cognitive control systems. However, a theoretically justified construction of state-space hierarchies over all spatial resolutions and their evolution through a policy inference process remains elusive. Here, a formalism for deriving such normative representations of discrete Markov decision processes is introduced in the context of graphs. The resulting hierarchies correspond to a hierarchical policy inference algorithm approximating a discrete gradient flow between state-space trajectory densities generated by the prior and optimal policies.

This paper has not been read by Pith yet.

discussion (0)

Sign in with ORCID, Apple, or X to comment. Anyone can read and Pith papers without signing in.