LGHCDGMay 16, 2024

Manifold Integrated Gradients: Riemannian Geometry for Feature Attribution

arXiv:2405.09800v117 citationsh-index: 3ICML
Originality Incremental advance
AI Analysis

This addresses noisy visualizations and adversarial vulnerabilities in feature attribution for deep learning models, though it is incremental.

The paper tackled the reliability issues of Integrated Gradients by adapting it to align with the data manifold's geometry, resulting in more intuitive explanations and increased robustness to attacks.

In this paper, we dive into the reliability concerns of Integrated Gradients (IG), a prevalent feature attribution method for black-box deep learning models. We particularly address two predominant challenges associated with IG: the generation of noisy feature visualizations for vision models and the vulnerability to adversarial attributional attacks. Our approach involves an adaptation of path-based feature attribution, aligning the path of attribution more closely to the intrinsic geometry of the data manifold. Our experiments utilise deep generative models applied to several real-world image datasets. They demonstrate that IG along the geodesics conforms to the curved geometry of the Riemannian data manifold, generating more perceptually intuitive explanations and, subsequently, substantially increasing robustness to targeted attributional attacks.

Code Implementations1 repo
Foundations

The foundational work for this paper's niche, ranked by how specifically the neighbourhood builds on it — not by global fame.

Your Notes