CLAILGNov 21, 2024

Understanding World or Predicting Future? A Comprehensive Survey of World Models

arXiv:2411.14499v3158 citationsh-index: 34Has CodeACM Computing Surveys
Originality Synthesis-oriented
AI Analysis

It provides a systematic overview for researchers and practitioners in AI, but is incremental as it synthesizes existing literature without new results.

This survey reviews world models, categorizing them as tools for understanding the present world or predicting future dynamics, and examines their applications in domains like generative games and autonomous driving.

The concept of world models has garnered significant attention due to advancements in multimodal large language models such as GPT-4 and video generation models such as Sora, which are central to the pursuit of artificial general intelligence. This survey offers a comprehensive review of the literature on world models. Generally, world models are regarded as tools for either understanding the present state of the world or predicting its future dynamics. This review presents a systematic categorization of world models, emphasizing two primary functions: (1) constructing internal representations to understand the mechanisms of the world, and (2) predicting future states to simulate and guide decision-making. Initially, we examine the current progress in these two categories. We then explore the application of world models in key domains, including generative games, autonomous driving, robotics, and social simulacra, with a focus on how each domain utilizes these aspects. Finally, we outline key challenges and provide insights into potential future research directions. We summarize the representative papers along with their code repositories in https://github.com/tsinghua-fib-lab/World-Model.

Foundations

The foundational work for this paper's niche, ranked by how specifically the neighbourhood builds on it — not by global fame.

Your Notes