CVMay 2, 2024

OmniDrive: A Holistic Vision-Language Dataset for Autonomous Driving with Counterfactual Reasoning

arXiv:2405.01533v2140 citationsh-index: 19CVPR
Originality Incremental advance
AI Analysis

This work addresses the problem of enhancing autonomous driving systems with better 3D reasoning for researchers and developers, though it appears incremental as it builds on existing vision-language models and datasets.

The authors tackled the challenge of extending vision-language models from 2D to 3D understanding for autonomous driving by proposing OmniDrive, a holistic dataset with counterfactual reasoning, which led to significant improvements on benchmarks like DriveLM Q&A and nuScenes open-loop planning.

The advances in vision-language models (VLMs) have led to a growing interest in autonomous driving to leverage their strong reasoning capabilities. However, extending these capabilities from 2D to full 3D understanding is crucial for real-world applications. To address this challenge, we propose OmniDrive, a holistic vision-language dataset that aligns agent models with 3D driving tasks through counterfactual reasoning. This approach enhances decision-making by evaluating potential scenarios and their outcomes, similar to human drivers considering alternative actions. Our counterfactual-based synthetic data annotation process generates large-scale, high-quality datasets, providing denser supervision signals that bridge planning trajectories and language-based reasoning. Futher, we explore two advanced OmniDrive-Agent frameworks, namely Omni-L and Omni-Q, to assess the importance of vision-language alignment versus 3D perception, revealing critical insights into designing effective LLM-agents. Significant improvements on the DriveLM Q\&A benchmark and nuScenes open-loop planning demonstrate the effectiveness of our dataset and methods.

Code Implementations1 repo
Foundations

The foundational work for this paper's niche, ranked by how specifically the neighbourhood builds on it — not by global fame.

Your Notes