CVAIMar 22, 2025

Multi-modality Anomaly Segmentation on the Road

arXiv:2503.17712v1h-index: 8Has Code
Originality Incremental advance
AI Analysis

This work addresses safety-critical anomaly detection in autonomous driving, though it is incremental as it builds on existing uni-modal methods by adding multi-modality.

The paper tackles the problem of false positives in anomaly segmentation for autonomous driving by introducing MMRAS+, a multi-modal framework that uses text encoders to reduce high anomaly scores in non-anomalous regions, achieving superior performance on datasets like RoadAnomaly, SMIYC, and Fishyscapes.

Semantic segmentation allows autonomous driving cars to understand the surroundings of the vehicle comprehensively. However, it is also crucial for the model to detect obstacles that may jeopardize the safety of autonomous driving systems. Based on our experiments, we find that current uni-modal anomaly segmentation frameworks tend to produce high anomaly scores for non-anomalous regions in images. Motivated by this empirical finding, we develop a multi-modal uncertainty-based anomaly segmentation framework, named MMRAS+, for autonomous driving systems. MMRAS+ effectively reduces the high anomaly outputs of non-anomalous classes by introducing text-modal using the CLIP text encoder. Indeed, MMRAS+ is the first multi-modal anomaly segmentation solution for autonomous driving. Moreover, we develop an ensemble module to further boost the anomaly segmentation performance. Experiments on RoadAnomaly, SMIYC, and Fishyscapes validation datasets demonstrate the superior performance of our method. The code is available in https://github.com/HengGao12/MMRAS_plus.

Code Implementations1 repo
Foundations

The foundational work for this paper's niche, ranked by how specifically the neighbourhood builds on it — not by global fame.

Your Notes