LGApr 8

Predictive Representations for Skill Transfer in Reinforcement Learning

arXiv:2604.0701633.0
Predicted impact top 70% in LG · last 90 daysOriginality Highly original
AI Analysis

This work addresses the problem of scaling reinforcement learning by enabling skill transfer, which is incremental as it builds on existing state and action abstraction methods.

The paper tackles the challenge of knowledge transfer in reinforcement learning by introducing Outcome-Predictive State Representations (OPSRs) and OPSR-based skills, which enable agents to reuse learned skills in new tasks, speeding up learning considerably without pre-processing.

A key challenge in scaling up Reinforcement Learning is generalizing learned behaviour. Without the ability to carry forward acquired knowledge an agent is doomed to learn each task from scratch. In this paper we develop a new formalism for transfer by virtue of state abstraction. Based on task-independent, compact observations (outcomes) of the environment, we introduce Outcome-Predictive State Representations (OPSRs), agent-centered and task-independent abstractions that are made up of predictions of outcomes. We show formally and empirically that they have the potential for optimal but limited transfer, then overcome this trade-off by introducing OPSR-based skills, i.e. abstract actions (based on options) that can be reused between tasks as a result of state abstraction. In a series of empirical studies, we learn OPSR-based skills from demonstrations and show how they speed up learning considerably in entirely new and unseen tasks without any pre-processing. We believe that the framework introduced in this work is a promising step towards transfer in RL in general, and towards transfer through combining state and action abstraction specifically.

Foundations

The foundational work for this paper's niche, ranked by how specifically the neighbourhood builds on it — not by global fame.

Your Notes