SYAINCDec 29, 2017

Characterizing optimal hierarchical policy inference on graphs via non-equilibrium thermodynamics

arXiv:1801.00048v1
Originality Highly original
AI Analysis

This work addresses a foundational gap in stochastic optimal control and biological control, potentially impacting sensorimotor and cognitive systems.

The paper tackles the problem of constructing theoretically justified state-space hierarchies for Markov decision processes on graphs, introducing a formalism that yields a hierarchical policy inference algorithm approximating a discrete gradient flow between trajectory densities.

Hierarchies are of fundamental interest in both stochastic optimal control and biological control due to their facilitation of a range of desirable computational traits in a control algorithm and the possibility that they may form a core principle of sensorimotor and cognitive control systems. However, a theoretically justified construction of state-space hierarchies over all spatial resolutions and their evolution through a policy inference process remains elusive. Here, a formalism for deriving such normative representations of discrete Markov decision processes is introduced in the context of graphs. The resulting hierarchies correspond to a hierarchical policy inference algorithm approximating a discrete gradient flow between state-space trajectory densities generated by the prior and optimal policies.

Foundations

The foundational work for this paper's niche, ranked by how specifically the neighbourhood builds on it — not by global fame.

Your Notes