LGAICVJun 27, 2024

Efficient World Models with Context-Aware Tokenization

arXiv:2406.19320v129 citationsHas Code
Originality Incremental advance
AI Analysis

This addresses the problem of slow training in model-based RL for researchers and practitioners, offering a more efficient method that is incremental over prior attention-based approaches.

The paper tackles the computational inefficiency of transformer-based world models in reinforcement learning by proposing $\Delta$-IRIS, which uses a discrete autoencoder for stochastic deltas and an autoregressive transformer with continuous tokens, achieving new state-of-the-art results on the Crafter benchmark while being an order of magnitude faster to train.

Scaling up deep Reinforcement Learning (RL) methods presents a significant challenge. Following developments in generative modelling, model-based RL positions itself as a strong contender. Recent advances in sequence modelling have led to effective transformer-based world models, albeit at the price of heavy computations due to the long sequences of tokens required to accurately simulate environments. In this work, we propose $Δ$-IRIS, a new agent with a world model architecture composed of a discrete autoencoder that encodes stochastic deltas between time steps and an autoregressive transformer that predicts future deltas by summarizing the current state of the world with continuous tokens. In the Crafter benchmark, $Δ$-IRIS sets a new state of the art at multiple frame budgets, while being an order of magnitude faster to train than previous attention-based approaches. We release our code and models at https://github.com/vmicheli/delta-iris.

Code Implementations1 repo
Foundations

The foundational work for this paper's niche, ranked by how specifically the neighbourhood builds on it — not by global fame.

Your Notes