LGOct 17, 2025

Zero-shot World Models via Search in Memory

arXiv:2510.16123v1h-index: 4
Originality Incremental advance
AI Analysis

This addresses the challenge of sample efficiency in reinforcement learning by providing a zero-shot alternative to training-based world models, though it is incremental as it builds on existing methods like Dreamer and PlaNet.

The paper tackled the problem of approximating world models without training by using similarity search and stochastic representations, achieving comparable performance to training-based models in latent reconstruction and image similarity, with stronger long-horizon prediction on diverse environments.

World Models have vastly permeated the field of Reinforcement Learning. Their ability to model the transition dynamics of an environment have greatly improved sample efficiency in online RL. Among them, the most notorious example is Dreamer, a model that learns to act in a diverse set of image-based environments. In this paper, we leverage similarity search and stochastic representations to approximate a world model without a training procedure. We establish a comparison with PlaNet, a well-established world model of the Dreamer family. We evaluate the models on the quality of latent reconstruction and on the perceived similarity of the reconstructed image, on both next-step and long horizon dynamics prediction. The results of our study demonstrate that a search-based world model is comparable to a training based one in both cases. Notably, our model show stronger performance in long-horizon prediction with respect to the baseline on a range of visually different environments.

Foundations

The foundational work for this paper's niche, ranked by how specifically the neighbourhood builds on it — not by global fame.

Your Notes