LGAICLMay 7, 2024

Policy Learning with a Language Bottleneck

arXiv:2405.04118v24 citationsh-index: 66Has Code
Originality Incremental advance
AI Analysis

This addresses the issue of opaque AI decision-making for users in applications like robotics and gaming, though it appears incremental by building on existing language-model and policy-learning methods.

The paper tackles the problem of AI systems lacking human-like generalization and interpretability by introducing Policy Learning with a Language Bottleneck (PLLB), a framework that enables agents to generate linguistic rules to guide policies, resulting in more interpretable and generalizable behaviors across five diverse tasks.

Modern AI systems such as self-driving cars and game-playing agents achieve superhuman performance, but often lack human-like generalization, interpretability, and inter-operability with human users. Inspired by the rich interactions between language and decision-making in humans, we introduce Policy Learning with a Language Bottleneck (PLLB), a framework enabling AI agents to generate linguistic rules that capture the high-level strategies underlying rewarding behaviors. PLLB alternates between a *rule generation* step guided by language models, and an *update* step where agents learn new policies guided by rules, even when a rule is insufficient to describe an entire complex policy. Across five diverse tasks, including a two-player signaling game, maze navigation, image reconstruction, and robot grasp planning, we show that PLLB agents are not only able to learn more interpretable and generalizable behaviors, but can also share the learned rules with human users, enabling more effective human-AI coordination. We provide source code for our experiments at https://github.com/meghabyte/bottleneck .

Code Implementations1 repo
Foundations

The foundational work for this paper's niche, ranked by how specifically the neighbourhood builds on it — not by global fame.

Your Notes