ROAILGMar 10, 2025

Combating Partial Perception Deficit in Autonomous Driving with Multimodal LLM Commonsense

arXiv:2503.07020v11 citationsh-index: 5
Originality Incremental advance
AI Analysis

This work addresses safety and flexibility issues in autonomous driving for scenarios with perception deficits, representing an incremental improvement over existing methods.

The paper tackles the problem of partial perception deficits compromising autonomous vehicle safety by proposing LLM-RCO, a framework that integrates large language models for human-like driving commonsense, resulting in significantly improved driving performance in adverse conditions as demonstrated in CARLA simulator experiments.

Partial perception deficits can compromise autonomous vehicle safety by disrupting environmental understanding. Current protocols typically respond with immediate stops or minimal-risk maneuvers, worsening traffic flow and lacking flexibility for rare driving scenarios. In this paper, we propose LLM-RCO, a framework leveraging large language models to integrate human-like driving commonsense into autonomous systems facing perception deficits. LLM-RCO features four key modules: hazard inference, short-term motion planner, action condition verifier, and safety constraint generator. These modules interact with the dynamic driving environment, enabling proactive and context-aware control actions to override the original control policy of autonomous agents. To improve safety in such challenging conditions, we construct DriveLM-Deficit, a dataset of 53,895 video clips featuring deficits of safety-critical objects, complete with annotations for LLM-based hazard inference and motion planning fine-tuning. Extensive experiments in adverse driving conditions with the CARLA simulator demonstrate that systems equipped with LLM-RCO significantly improve driving performance, highlighting its potential for enhancing autonomous driving resilience against adverse perception deficits. Our results also show that LLMs fine-tuned with DriveLM-Deficit can enable more proactive movements instead of conservative stops in the context of perception deficits.

Foundations

The foundational work for this paper's niche, ranked by how specifically the neighbourhood builds on it — not by global fame.

Your Notes