CVAIOct 14, 2024

DrivingDojo Dataset: Advancing Interactive and Knowledge-Enriched Driving World Model

arXiv:2410.10738v130 citationsh-index: 23NIPS
Originality Synthesis-oriented
AI Analysis

This addresses a data bottleneck for researchers developing driving world models, though it is incremental as it focuses on dataset creation rather than a new method.

The authors tackled the limited video diversity in driving datasets by introducing DrivingDojo, a dataset designed for training interactive world models, which improved action-controlled future predictions as demonstrated on a new benchmark.

Driving world models have gained increasing attention due to their ability to model complex physical dynamics. However, their superb modeling capability is yet to be fully unleashed due to the limited video diversity in current driving datasets. We introduce DrivingDojo, the first dataset tailor-made for training interactive world models with complex driving dynamics. Our dataset features video clips with a complete set of driving maneuvers, diverse multi-agent interplay, and rich open-world driving knowledge, laying a stepping stone for future world model development. We further define an action instruction following (AIF) benchmark for world models and demonstrate the superiority of the proposed dataset for generating action-controlled future predictions.

Foundations

The foundational work for this paper's niche, ranked by how specifically the neighbourhood builds on it — not by global fame.

Your Notes