CVAIMay 24, 2024

Exposing Image Classifier Shortcuts with Counterfactual Frequency (CoF) Tables

arXiv:2405.15661v24 citationsh-index: 4
Originality Incremental advance
AI Analysis

This addresses the issue of labor-intensive shortcut detection in image classification for the explainable AI community, though it is incremental as it builds on existing instance-level explanation methods.

The paper tackles the problem of deep learning image classifiers using shortcuts, which are easy-to-learn patterns that fail to generalize, by introducing Counterfactual Frequency (CoF) tables to aggregate instance-level explanations into global insights and expose these shortcuts, demonstrating utility across several datasets.

The rise of deep learning in image classification has brought unprecedented accuracy but also highlighted a key issue: the use of 'shortcuts' by models. Such shortcuts are easy-to-learn patterns from the training data that fail to generalise to new data. Examples include the use of a copyright watermark to recognise horses, snowy background to recognise huskies, or ink markings to detect malignant skin lesions. The explainable AI (XAI) community has suggested using instance-level explanations to detect shortcuts without external data, but this requires the examination of many explanations to confirm the presence of such shortcuts, making it a labour-intensive process. To address these challenges, we introduce Counterfactual Frequency (CoF) tables, a novel approach that aggregates instance-based explanations into global insights, and exposes shortcuts. The aggregation implies the need for some semantic concepts to be used in the explanations, which we solve by labelling the segments of an image. We demonstrate the utility of CoF tables across several datasets, revealing the shortcuts learned from them.

Foundations

The foundational work for this paper's niche, ranked by how specifically the neighbourhood builds on it — not by global fame.

Your Notes