ROMay 25

Decision-Making with Lightweight Confidence-Aware Language Model for Autonomous Driving

Ruoyu Yao, Ruiguo Zhong, Pei Liu, Mingxing Peng, Rui Yang, Jun Ma

arXiv:2605.2539377.7

Predicted impact top 28% in RO · last 90 daysOriginality Incremental advance

AI Analysis

For autonomous driving systems, this work enables efficient deployment of language-model-based decision-making with improved performance in regular and long-tail scenarios.

The paper proposes a lightweight confidence-aware language model for autonomous driving that achieves state-of-the-art success rates on the nuPlan benchmark while maintaining low inference latency, addressing the computational overhead of large models.

Large Language Models (LLMs) and Multimodal LLMs (MLLMs) have demonstrated immense potential in autonomous driving (AD) by offering human-like reasoning and open-world generalization. However, the excessive computational overhead and high inference latency of these massive models severely hinder their deployment in resource-constrained AD systems. To address this challenge, we propose a novel decision-making framework utilizing a lightweight confidence-aware language model, which bridges the gap between complex multimodal intention reasoning and efficient inference. Specifically, we design a multi-agent collaborative workflow, comprising action voting, confidence assessment, and summarization agents, to generate high-quality, confidence-annotated decision demonstrations via explicit Chain-of-Thought (CoT) reasoning. These demonstrations are then distilled into a lightweight language model featuring a dual-head architecture, enabling the joint prediction of decision probabilities and the generation of textual rationales. The distillation is realized via a confidence-aware fine-tuning strategy coupled with Retrieval Augmented Generation (RAG) to enhance the model's adaptability and data efficiency. Comprehensive closed-loop experiments on the nuPlan benchmark demonstrate that our approach achieves state-of-the-art (SOTA) success rates in both regular and long-tail scenarios while maintaining low inference latency.

View on arXiv PDF

Similar