LGAISYMLSep 11, 2019

Correlation Priors for Reinforcement Learning

arXiv:1909.05106v212 citations
Originality Incremental advance
AI Analysis

This work addresses a gap in principled solutions for discrete environments, benefiting researchers in reinforcement learning and related fields, though it appears incremental as it extends existing methods to new domains.

The paper tackles the problem of modeling correlated structures in discrete decision-making environments, presenting a Bayesian learning framework that yields superior predictive performance even with significantly smaller datasets.

Many decision-making problems naturally exhibit pronounced structures inherited from the characteristics of the underlying environment. In a Markov decision process model, for example, two distinct states can have inherently related semantics or encode resembling physical state configurations. This often implies locally correlated transition dynamics among the states. In order to complete a certain task in such environments, the operating agent usually needs to execute a series of temporally and spatially correlated actions. Though there exists a variety of approaches to capture these correlations in continuous state-action domains, a principled solution for discrete environments is missing. In this work, we present a Bayesian learning framework based on Pólya-Gamma augmentation that enables an analogous reasoning in such cases. We demonstrate the framework on a number of common decision-making related problems, such as imitation learning, subgoal extraction, system identification and Bayesian reinforcement learning. By explicitly modeling the underlying correlation structures of these problems, the proposed approach yields superior predictive performance compared to correlation-agnostic models, even when trained on data sets that are an order of magnitude smaller in size.

Foundations

The foundational work for this paper's niche, ranked by how specifically the neighbourhood builds on it — not by global fame.

Your Notes