CVAug 1, 2024

DECIDER: Leveraging Foundation Model Priors for Improved Model Failure Detection and Explanation

arXiv:2408.00331v16 citationsh-index: 12Has Code
AI Analysis

This addresses the need for reliable failure detection in deployed models, particularly for image classification, though it is incremental as it builds on existing methods for debiasing and explanation.

The paper tackles the problem of detecting when machine learning models are likely to fail on inputs by proposing DECIDER, which leverages foundation models to debias classifiers and detect failures, achieving state-of-the-art performance with significant improvements in metrics like Matthews correlation coefficient and recall.

Reliably detecting when a deployed machine learning model is likely to fail on a given input is crucial for ensuring safe operation. In this work, we propose DECIDER (Debiasing Classifiers to Identify Errors Reliably), a novel approach that leverages priors from large language models (LLMs) and vision-language models (VLMs) to detect failures in image classification models. DECIDER utilizes LLMs to specify task-relevant core attributes and constructs a ``debiased'' version of the classifier by aligning its visual features to these core attributes using a VLM, and detects potential failure by measuring disagreement between the original and debiased models. In addition to proactively identifying samples on which the model would fail, DECIDER also provides human-interpretable explanations for failure through a novel attribute-ablation strategy. Through extensive experiments across diverse benchmarks spanning subpopulation shifts (spurious correlations, class imbalance) and covariate shifts (synthetic corruptions, domain shifts), DECIDER consistently achieves state-of-the-art failure detection performance, significantly outperforming baselines in terms of the overall Matthews correlation coefficient as well as failure and success recall. Our codes can be accessed at~\url{https://github.com/kowshikthopalli/DECIDER/}

Code Implementations1 repo
Foundations

The foundational work for this paper's niche, ranked by how specifically the neighbourhood builds on it — not by global fame.

Your Notes