ROAIMay 30, 2025

RealDrive: Retrieval-Augmented Driving with Diffusion Models

arXiv:2505.24808v11 citationsh-index: 24
Originality Incremental advance
AI Analysis

This addresses safety and generalization issues in autonomous driving planners, though it is incremental as it builds on existing retrieval-augmented and diffusion-based methods.

The paper tackles the problem of rare, safety-critical scenarios and limited controllability in learning-based driving planners by proposing RealDrive, a retrieval-augmented generation framework that uses diffusion models, resulting in a 40% reduction in collision rate on the Waymo Open Motion dataset.

Learning-based planners generate natural human-like driving behaviors by learning to reason about nuanced interactions from data, overcoming the rigid behaviors that arise from rule-based planners. Nonetheless, data-driven approaches often struggle with rare, safety-critical scenarios and offer limited controllability over the generated trajectories. To address these challenges, we propose RealDrive, a Retrieval-Augmented Generation (RAG) framework that initializes a diffusion-based planning policy by retrieving the most relevant expert demonstrations from the training dataset. By interpolating between current observations and retrieved examples through a denoising process, our approach enables fine-grained control and safe behavior across diverse scenarios, leveraging the strong prior provided by the retrieved scenario. Another key insight we produce is that a task-relevant retrieval model trained with planning-based objectives results in superior planning performance in our framework compared to a task-agnostic retriever. Experimental results demonstrate improved generalization to long-tail events and enhanced trajectory diversity compared to standard learning-based planners -- we observe a 40% reduction in collision rate on the Waymo Open Motion dataset with RAG.

Foundations

The foundational work for this paper's niche, ranked by how specifically the neighbourhood builds on it — not by global fame.

Your Notes