CVFeb 13

The Constant Eye: Benchmarking and Bridging Appearance Robustness in Autonomous Driving

arXiv:2602.12563v1h-index: 5
Originality Highly original
AI Analysis

This addresses a critical robustness issue for autonomous driving systems, though it is incremental in focusing on appearance shifts rather than structural changes.

The paper tackles the problem of autonomous driving algorithms being fragile under appearance-based Out-of-Distribution conditions, such as weather and lighting, by establishing a benchmark to isolate these effects and proposing a perception interface that achieves exceptional zero-shot generalization across planning models without fine-tuning.

Despite rapid progress, autonomous driving algorithms remain notoriously fragile under Out-of-Distribution (OOD) conditions. We identify a critical decoupling failure in current research: the lack of distinction between appearance-based shifts, such as weather and lighting, and structural scene changes. This leaves a fundamental question unanswered: Is the planner failing because of complex road geometry, or simply because it is raining? To resolve this, we establish navdream, a high-fidelity robustness benchmark leveraging generative pixel-aligned style transfer. By creating a visual stress test with negligible geometric deviation, we isolate the impact of appearance on driving performance. Our evaluation reveals that existing planning algorithms often show significant degradation under OOD appearance conditions, even when the underlying scene structure remains consistent. To bridge this gap, we propose a universal perception interface leveraging a frozen visual foundation model (DINOv3). By extracting appearance-invariant features as a stable interface for the planner, we achieve exceptional zero-shot generalization across diverse planning paradigms, including regression-based, diffusion-based, and scoring-based models. Our plug-and-play solution maintains consistent performance across extreme appearance shifts without requiring further fine-tuning. The benchmark and code will be made available.

Foundations

The foundational work for this paper's niche, ranked by how specifically the neighbourhood builds on it — not by global fame.

Your Notes