LGAIDSMLJun 9, 2020

Constrained episodic reinforcement learning in concave-convex and knapsack settings

arXiv:2006.05051v256 citations
AI Analysis

This work addresses constrained reinforcement learning for scenarios requiring theoretical guarantees and practical performance, though it appears incremental by extending beyond linear constraints and single-episode settings.

The paper tackles constrained episodic reinforcement learning by proposing an algorithm for settings with concave rewards and convex constraints, as well as hard knapsack constraints, and demonstrates that it significantly outperforms previous approaches in existing environments.

We propose an algorithm for tabular episodic reinforcement learning with constraints. We provide a modular analysis with strong theoretical guarantees for settings with concave rewards and convex constraints, and for settings with hard constraints (knapsacks). Most of the previous work in constrained reinforcement learning is limited to linear constraints, and the remaining work focuses on either the feasibility question or settings with a single episode. Our experiments demonstrate that the proposed algorithm significantly outperforms these approaches in existing constrained episodic environments.

Code Implementations1 repo
Foundations

The foundational work for this paper's niche, ranked by how specifically the neighbourhood builds on it — not by global fame.

Your Notes