LGAIOct 23, 2024

Learning Versatile Skills with Curriculum Masking

arXiv:2410.17744v23 citationsh-index: 19NIPS
Originality Incremental advance
AI Analysis

This work addresses a bottleneck in offline reinforcement learning for researchers and practitioners by improving skill acquisition through curriculum learning, though it is incremental as it builds on existing masked prediction methods.

The paper tackles the problem of balancing skill complexity learning in masked prediction for offline RL by proposing CurrMask, a curriculum masking pretraining paradigm, which achieves superior zero-shot performance on skill prompting and goal-conditioned planning tasks, and competitive finetuning on offline RL tasks.

Masked prediction has emerged as a promising pretraining paradigm in offline reinforcement learning (RL) due to its versatile masking schemes, enabling flexible inference across various downstream tasks with a unified model. Despite the versatility of masked prediction, it remains unclear how to balance the learning of skills at different levels of complexity. To address this, we propose CurrMask, a curriculum masking pretraining paradigm for sequential decision making. Motivated by how humans learn by organizing knowledge in a curriculum, CurrMask adjusts its masking scheme during pretraining for learning versatile skills. Through extensive experiments, we show that CurrMask exhibits superior zero-shot performance on skill prompting tasks, goal-conditioned planning tasks, and competitive finetuning performance on offline RL tasks. Additionally, our analysis of training dynamics reveals that CurrMask gradually acquires skills of varying complexity by dynamically adjusting its masking scheme.

Code Implementations1 repo
Foundations

The foundational work for this paper's niche, ranked by how specifically the neighbourhood builds on it — not by global fame.

Your Notes