CVMay 7, 2025

Are Synthetic Corruptions A Reliable Proxy For Real-World Corruptions?

Shashank Agnihotri, David Schader, Nico Sharei, Mehmet Ege Kaçar, Margret Keuper

arXiv:2505.04835v113.15 citationsh-index: 17Has Code

Originality Synthesis-oriented

AI Analysis

This addresses the resource-intensive challenge of collecting real-world data for robustness testing in computer vision, though it is incremental as it builds on existing benchmarking methods.

The study tackled the problem of evaluating deep learning model robustness to distribution shifts by comparing synthetic and real-world corruptions, finding a strong correlation in mean performance for semantic segmentation models, supporting synthetic corruptions as a reliable proxy.

Deep learning (DL) models are widely used in real-world applications but remain vulnerable to distribution shifts, especially due to weather and lighting changes. Collecting diverse real-world data for testing the robustness of DL models is resource-intensive, making synthetic corruptions an attractive alternative for robustness testing. However, are synthetic corruptions a reliable proxy for real-world corruptions? To answer this, we conduct the largest benchmarking study on semantic segmentation models, comparing performance on real-world corruptions and synthetic corruptions datasets. Our results reveal a strong correlation in mean performance, supporting the use of synthetic corruptions for robustness evaluation. We further analyze corruption-specific correlations, providing key insights to understand when synthetic corruptions succeed in representing real-world corruptions. Open-source Code: https://github.com/shashankskagnihotri/benchmarking_robustness/tree/segmentation_david/semantic_segmentation

View on arXiv PDF Code

Similar