CVMay 7, 2025

Are Synthetic Corruptions A Reliable Proxy For Real-World Corruptions?

arXiv:2505.04835v15 citationsh-index: 17Has Code
Originality Synthesis-oriented
AI Analysis

This addresses the resource-intensive challenge of collecting real-world data for robustness testing in computer vision, though it is incremental as it builds on existing benchmarking methods.

The study tackled the problem of evaluating deep learning model robustness to distribution shifts by comparing synthetic and real-world corruptions, finding a strong correlation in mean performance for semantic segmentation models, supporting synthetic corruptions as a reliable proxy.

Deep learning (DL) models are widely used in real-world applications but remain vulnerable to distribution shifts, especially due to weather and lighting changes. Collecting diverse real-world data for testing the robustness of DL models is resource-intensive, making synthetic corruptions an attractive alternative for robustness testing. However, are synthetic corruptions a reliable proxy for real-world corruptions? To answer this, we conduct the largest benchmarking study on semantic segmentation models, comparing performance on real-world corruptions and synthetic corruptions datasets. Our results reveal a strong correlation in mean performance, supporting the use of synthetic corruptions for robustness evaluation. We further analyze corruption-specific correlations, providing key insights to understand when synthetic corruptions succeed in representing real-world corruptions. Open-source Code: https://github.com/shashankskagnihotri/benchmarking_robustness/tree/segmentation_david/semantic_segmentation

Code Implementations1 repo
Foundations

The foundational work for this paper's niche, ranked by how specifically the neighbourhood builds on it — not by global fame.

Your Notes