AISYAug 20, 2024

Hybrid Recurrent Models Support Emergent Descriptions for Hierarchical Planning and Control

arXiv:2408.10970v1h-index: 4
Originality Incremental advance
AI Analysis

This work addresses the challenge of flexible abstraction learning for hierarchical planning in AI, offering a novel integration of methods for continuous control tasks.

The paper tackled the problem of learning discrete abstractions for continuous control by proposing a hierarchical model-based algorithm that uses a recurrent switching linear dynamical system (rSLDS) to enable planning with temporally-abstracted sub-goals and enhanced exploration. It demonstrated success on the sparse Continuous Mountain Car task, achieving fast system identification and non-trivial planning.

An open problem in artificial intelligence is how systems can flexibly learn discrete abstractions that are useful for solving inherently continuous problems. Previous work has demonstrated that a class of hybrid state-space model known as recurrent switching linear dynamical systems (rSLDS) discover meaningful behavioural units via the piecewise linear decomposition of complex continuous dynamics (Linderman et al., 2016). Furthermore, they model how the underlying continuous states drive these discrete mode switches. We propose that the rich representations formed by an rSLDS can provide useful abstractions for planning and control. We present a novel hierarchical model-based algorithm inspired by Active Inference in which a discrete MDP sits above a low-level linear-quadratic controller. The recurrent transition dynamics learned by the rSLDS allow us to (1) specify temporally-abstracted sub-goals in a method reminiscent of the options framework, (2) lift the exploration into discrete space allowing us to exploit information-theoretic exploration bonuses and (3) `cache' the approximate solutions to low-level problems in the discrete planner. We successfully apply our model to the sparse Continuous Mountain Car task, demonstrating fast system identification via enhanced exploration and non-trivial planning through the delineation of abstract sub-goals.

Foundations

The foundational work for this paper's niche, ranked by how specifically the neighbourhood builds on it — not by global fame.

Your Notes