CVAILGApr 11, 2025

On Background Bias of Post-Hoc Concept Embeddings in Computer Vision DNNs

arXiv:2504.08602v11 citationsh-index: 11xAI
Originality Incremental advance
AI Analysis

This work addresses a critical oversight in explainable AI for computer vision, revealing that current methods may be unreliable due to background biases, which is an incremental but important contribution to improving robustness.

The study investigated whether post-hoc concept-based explainable AI methods in computer vision capture background biases, finding that established techniques frequently exhibit such biases, including underperformance on road scenes, across multiple datasets and architectures.

The thriving research field of concept-based explainable artificial intelligence (C-XAI) investigates how human-interpretable semantic concepts embed in the latent spaces of deep neural networks (DNNs). Post-hoc approaches therein use a set of examples to specify a concept, and determine its embeddings in DNN latent space using data driven techniques. This proved useful to uncover biases between different target (foreground or concept) classes. However, given that the background is mostly uncontrolled during training, an important question has been left unattended so far: Are/to what extent are state-of-the-art, data-driven post-hoc C-XAI approaches themselves prone to biases with respect to their backgrounds? E.g., wild animals mostly occur against vegetation backgrounds, and they seldom appear on roads. Even simple and robust C-XAI methods might abuse this shortcut for enhanced performance. A dangerous performance degradation of the concept-corner cases of animals on the road could thus remain undiscovered. This work validates and thoroughly confirms that established Net2Vec-based concept segmentation techniques frequently capture background biases, including alarming ones, such as underperformance on road scenes. For the analysis, we compare 3 established techniques from the domain of background randomization on >50 concepts from 2 datasets, and 7 diverse DNN architectures. Our results indicate that even low-cost setups can provide both valuable insight and improved background robustness.

Code Implementations1 repo
Foundations

The foundational work for this paper's niche, ranked by how specifically the neighbourhood builds on it — not by global fame.

Your Notes