CVNov 15, 2024

Fill in the blanks: Rethinking Interpretability in vision

arXiv:2411.10273v1h-index: 3Has Code
Originality Incremental advance
AI Analysis

This work addresses the need for adaptable, model-agnostic explainability tools in specialized domains like medical imaging, though it appears incremental as it builds on existing XAI concepts.

The paper tackles the problem of interpretability in vision models by proposing a method that probes learned input structures through masked-image filling, revealing consistent patterns across standard datasets and pre-trained models.

Model interpretability is a key challenge that has yet to align with the advancements observed in contemporary state-of-the-art deep learning models. In particular, deep learning aided vision tasks require interpretability, in order for their adoption in more specialized domains such as medical imaging. Although the field of explainable AI (XAI) developed methods for interpreting vision models along with early convolutional neural networks, recent XAI research has mainly focused on assigning attributes via saliency maps. As such, these methods are restricted to providing explanations at a sample level, and many explainability methods suffer from low adaptability across a wide range of vision models. In our work, we re-think vision-model explainability from a novel perspective, to probe the general input structure that a model has learnt during its training. To this end, we ask the question: "How would a vision model fill-in a masked-image". Experiments on standard vision datasets and pre-trained models reveal consistent patterns, and could be intergrated as an additional model-agnostic explainability tool in modern machine-learning platforms. The code will be available at \url{https://github.com/BoTZ-TND/FillingTheBlanks.git}

Code Implementations1 repo
Foundations

The foundational work for this paper's niche, ranked by how specifically the neighbourhood builds on it — not by global fame.

Your Notes