CVLGNov 22, 2024

Derivative-Free Diffusion Manifold-Constrained Gradient for Unified XAI

arXiv:2411.15265v22 citationsh-index: 22CVPR
Originality Incremental advance
AI Analysis

This addresses the problem of improving explainability for neural networks, particularly in image-based models, by overcoming issues like white-box access requirements and adversarial vulnerability, though it is incremental as it builds on existing gradient-based and manifold techniques.

The paper tackled the shortcomings of gradient-based explainability methods by introducing FreeMCG, a derivative-free approach that projects gradients onto the data manifold using ensemble Kalman filters and diffusion models, achieving state-of-the-art results in counterfactual generation and feature attribution.

Gradient-based methods are a prototypical family of explainability techniques, especially for image-based models. Nonetheless, they have several shortcomings in that they (1) require white-box access to models, (2) are vulnerable to adversarial attacks, and (3) produce attributions that lie off the image manifold, leading to explanations that are not actually faithful to the model and do not align well with human perception. To overcome these challenges, we introduce Derivative-Free Diffusion Manifold-Constrainted Gradients (FreeMCG), a novel method that serves as an improved basis for explainability of a given neural network than the traditional gradient. Specifically, by leveraging ensemble Kalman filters and diffusion models, we derive a derivative-free approximation of the model's gradient projected onto the data manifold, requiring access only to the model's outputs. We demonstrate the effectiveness of FreeMCG by applying it to both counterfactual generation and feature attribution, which have traditionally been treated as distinct tasks. Through comprehensive evaluation on both tasks, counterfactual explanation and feature attribution, we show that our method yields state-of-the-art results while preserving the essential properties expected of XAI tools.

Code Implementations1 repo
Foundations

The foundational work for this paper's niche, ranked by how specifically the neighbourhood builds on it — not by global fame.

Your Notes