CV AIMar 22, 2025

Multi-modality Anomaly Segmentation on the Road

Heng Gao, Zhuolin He, Shoumeng Qiu, Xiangyang Xue, Jian Pu

arXiv:2503.17712v1h-index: 8Has Code

Originality Incremental advance

AI Analysis

This work addresses safety-critical anomaly detection in autonomous driving, though it is incremental as it builds on existing uni-modal methods by adding multi-modality.

The paper tackles the problem of false positives in anomaly segmentation for autonomous driving by introducing MMRAS+, a multi-modal framework that uses text encoders to reduce high anomaly scores in non-anomalous regions, achieving superior performance on datasets like RoadAnomaly, SMIYC, and Fishyscapes.

Semantic segmentation allows autonomous driving cars to understand the surroundings of the vehicle comprehensively. However, it is also crucial for the model to detect obstacles that may jeopardize the safety of autonomous driving systems. Based on our experiments, we find that current uni-modal anomaly segmentation frameworks tend to produce high anomaly scores for non-anomalous regions in images. Motivated by this empirical finding, we develop a multi-modal uncertainty-based anomaly segmentation framework, named MMRAS+, for autonomous driving systems. MMRAS+ effectively reduces the high anomaly outputs of non-anomalous classes by introducing text-modal using the CLIP text encoder. Indeed, MMRAS+ is the first multi-modal anomaly segmentation solution for autonomous driving. Moreover, we develop an ensemble module to further boost the anomaly segmentation performance. Experiments on RoadAnomaly, SMIYC, and Fishyscapes validation datasets demonstrate the superior performance of our method. The code is available in https://github.com/HengGao12/MMRAS_plus.

View on arXiv PDF Code

Similar