LG AIJan 27

From Observations to Events: Event-Aware World Model for Reinforcement Learning

Zhao-Han Peng, Shaohui Li, Zhi Li, Shulan Ruan, Yu Liu, You He

arXiv:2601.19336v12.71 citationsh-index: 3Has Code

Originality Incremental advance

AI Analysis

This addresses a key limitation in reinforcement learning for improving sample efficiency and robustness, though it appears incremental as it builds on existing world model architectures.

The paper tackles the problem of model-based reinforcement learning struggling with generalization across structurally similar scenes and spurious variations by proposing an Event-Aware World Model (EAWM) that learns event-aware representations from raw observations without handcrafted labels, resulting in performance boosts of 10%-45% over strong baselines across multiple benchmarks.

While model-based reinforcement learning (MBRL) improves sample efficiency by learning world models from raw observations, existing methods struggle to generalize across structurally similar scenes and remain vulnerable to spurious variations such as textures or color shifts. From a cognitive science perspective, humans segment continuous sensory streams into discrete events and rely on these key events for decision-making. Motivated by this principle, we propose the Event-Aware World Model (EAWM), a general framework that learns event-aware representations to streamline policy learning without requiring handcrafted labels. EAWM employs an automated event generator to derive events from raw observations and introduces a Generic Event Segmentor (GES) to identify event boundaries, which mark the start and end time of event segments. Through event prediction, the representation space is shaped to capture meaningful spatio-temporal transitions. Beyond this, we present a unified formulation of seemingly distinct world model architectures and show the broad applicability of our methods. Experiments on Atari 100K, Craftax 1M, and DeepMind Control 500K, DMC-GB2 500K demonstrate that EAWM consistently boosts the performance of strong MBRL baselines by 10%-45%, setting new state-of-the-art results across benchmarks. Our code is released at https://github.com/MarquisDarwin/EAWM.

View on arXiv PDF Code

Similar