LGAICVMay 20, 2024

Diffusion for World Modeling: Visual Details Matter in Atari

arXiv:2405.12399v2242 citationsh-index: 47NIPS
Originality Incremental advance
AI Analysis

This work addresses the challenge of sample-efficient and safe reinforcement learning for AI agents, representing an incremental improvement by applying diffusion models to world modeling.

The paper tackles the problem of world models in reinforcement learning by introducing DIAMOND, a diffusion-based world model that preserves visual details, achieving a mean human normalized score of 1.46 on the Atari 100k benchmark, which is a new best for agents trained entirely within a world model.

World models constitute a promising approach for training reinforcement learning agents in a safe and sample-efficient manner. Recent world models predominantly operate on sequences of discrete latent variables to model environment dynamics. However, this compression into a compact discrete representation may ignore visual details that are important for reinforcement learning. Concurrently, diffusion models have become a dominant approach for image generation, challenging well-established methods modeling discrete latents. Motivated by this paradigm shift, we introduce DIAMOND (DIffusion As a Model Of eNvironment Dreams), a reinforcement learning agent trained in a diffusion world model. We analyze the key design choices that are required to make diffusion suitable for world modeling, and demonstrate how improved visual details can lead to improved agent performance. DIAMOND achieves a mean human normalized score of 1.46 on the competitive Atari 100k benchmark; a new best for agents trained entirely within a world model. We further demonstrate that DIAMOND's diffusion world model can stand alone as an interactive neural game engine by training on static Counter-Strike: Global Offensive gameplay. To foster future research on diffusion for world modeling, we release our code, agents, videos and playable world models at https://diamond-wm.github.io.

Code Implementations1 repo
Foundations

The foundational work for this paper's niche, ranked by how specifically the neighbourhood builds on it — not by global fame.

Your Notes