CV AI LGApr 11, 2025

On Background Bias of Post-Hoc Concept Embeddings in Computer Vision DNNs

Gesina Schwalbe, Georgii Mikriukov, Edgar Heinert, Stavros Gerolymatos, Mert Keser, Alois Knoll, Matthias Rottmann, Annika Mütze

arXiv:2504.08602v16.21 citationsh-index: 11Has CodexAI

Originality Incremental advance

AI Analysis

This work addresses a critical oversight in explainable AI for computer vision, revealing that current methods may be unreliable due to background biases, which is an incremental but important contribution to improving robustness.

The study investigated whether post-hoc concept-based explainable AI methods in computer vision capture background biases, finding that established techniques frequently exhibit such biases, including underperformance on road scenes, across multiple datasets and architectures.

The thriving research field of concept-based explainable artificial intelligence (C-XAI) investigates how human-interpretable semantic concepts embed in the latent spaces of deep neural networks (DNNs). Post-hoc approaches therein use a set of examples to specify a concept, and determine its embeddings in DNN latent space using data driven techniques. This proved useful to uncover biases between different target (foreground or concept) classes. However, given that the background is mostly uncontrolled during training, an important question has been left unattended so far: Are/to what extent are state-of-the-art, data-driven post-hoc C-XAI approaches themselves prone to biases with respect to their backgrounds? E.g., wild animals mostly occur against vegetation backgrounds, and they seldom appear on roads. Even simple and robust C-XAI methods might abuse this shortcut for enhanced performance. A dangerous performance degradation of the concept-corner cases of animals on the road could thus remain undiscovered. This work validates and thoroughly confirms that established Net2Vec-based concept segmentation techniques frequently capture background biases, including alarming ones, such as underperformance on road scenes. For the analysis, we compare 3 established techniques from the domain of background randomization on >50 concepts from 2 datasets, and 7 diverse DNN architectures. Our results indicate that even low-cost setups can provide both valuable insight and improved background robustness.

View on arXiv PDF Code

Similar