LGAug 10, 2012

Inverse Reinforcement Learning with Gaussian Process

arXiv:1208.2112v239 citations
Originality Incremental advance
AI Analysis

This work addresses the challenge of scaling inverse reinforcement learning for real-world applications, though it appears incremental as it builds on existing Bayesian frameworks.

The authors tackled the problem of inverse reinforcement learning in large or infinite state spaces by proposing a Gaussian process model with preference graphs, which demonstrated better accuracy and robustness compared to an established algorithm on small-scale numerical problems.

We present new algorithms for inverse reinforcement learning (IRL, or inverse optimal control) in convex optimization settings. We argue that finite-space IRL can be posed as a convex quadratic program under a Bayesian inference framework with the objective of maximum a posterior estimation. To deal with problems in large or even infinite state space, we propose a Gaussian process model and use preference graphs to represent observations of decision trajectories. Our method is distinguished from other approaches to IRL in that it makes no assumptions about the form of the reward function and yet it retains the promise of computationally manageable implementations for potential real-world applications. In comparison with an establish algorithm on small-scale numerical problems, our method demonstrated better accuracy in apprenticeship learning and a more robust dependence on the number of observations.

Foundations

The foundational work for this paper's niche, ranked by how specifically the neighbourhood builds on it — not by global fame.

Your Notes