ROAIOct 20, 2025

SimpleVSF: VLM-Scoring Fusion for Trajectory Prediction of End-to-End Autonomous Driving

arXiv:2510.17191v22 citationsh-index: 3
Originality Incremental advance
AI Analysis

This addresses the problem of robust and intelligent driving policies for autonomous vehicles, representing an incremental improvement with novel fusion techniques.

The paper tackles suboptimal decision-making in end-to-end autonomous driving by proposing SimpleVSF, a framework that fuses Vision-Language Model (VLM) capabilities with trajectory scoring, achieving state-of-the-art performance in the ICCV 2025 NAVSIM v2 challenge.

End-to-end autonomous driving has emerged as a promising paradigm for achieving robust and intelligent driving policies. However, existing end-to-end methods still face significant challenges, such as suboptimal decision-making in complex scenarios. In this paper,we propose SimpleVSF (Simple VLM-Scoring Fusion), a novel framework that enhances end-to-end planning by leveraging the cognitive capabilities of Vision-Language Models (VLMs) and advanced trajectory fusion techniques. We utilize the conventional scorers and the novel VLM-enhanced scorers. And we leverage a robust weight fusioner for quantitative aggregation and a powerful VLM-based fusioner for qualitative, context-aware decision-making. As the leading approach in the ICCV 2025 NAVSIM v2 End-to-End Driving Challenge, our SimpleVSF framework demonstrates state-of-the-art performance, achieving a superior balance between safety, comfort, and efficiency.

Foundations

The foundational work for this paper's niche, ranked by how specifically the neighbourhood builds on it — not by global fame.

Your Notes