ROCVMar 23

Foundation Models for Trajectory Planning in Autonomous Driving: A Review of Progress and Open Challenges

arXiv:2512.0002135.51 citationsh-index: 28Has Code
Predicted impact top 5% in RO · last 90 daysOriginality Synthesis-oriented
AI Analysis

It addresses the problem of improving trajectory planning for autonomous driving researchers and practitioners, but is incremental as it reviews existing progress.

This review examines the shift from hand-crafted methods to foundation models for trajectory planning in autonomous driving, covering 37 approaches and evaluating their design, capabilities, and limitations.

The emergence of multi-modal foundation models has markedly transformed the technology for autonomous driving, shifting away from conventional and mostly hand-crafted design choices towards unified, foundation-model-based approaches, capable of directly inferring motion trajectories from raw sensory inputs. This new class of methods can also incorporate natural language as an additional modality, with Vision-Language-Action (VLA) models serving as a representative example. In this review, we provide a comprehensive examination of such methods through a unifying taxonomy to critically evaluate their architectural design choices, methodological strengths, and their inherent capabilities and limitations. Our survey covers 37 recently proposed approaches that span the landscape of trajectory planning with foundation models. Furthermore, we assess these approaches with respect to the openness of their source code and datasets, offering valuable information to practitioners and researchers. We provide an accompanying webpage that catalogues the methods based on our taxonomy, available at: https://github.com/fiveai/FMs-for-driving-trajectories

Foundations

The foundational work for this paper's niche, ranked by how specifically the neighbourhood builds on it — not by global fame.

Your Notes