LGMEFeb 13, 2024

Feature Attribution with Necessity and Sufficiency via Dual-stage Perturbation Test for Causal Explanation

arXiv:2402.08845v46 citationsh-index: 20Has CodeICML
AI Analysis

This addresses the need for more discriminative feature attribution methods in explainable AI, offering a novel causal approach that improves upon existing techniques.

The paper tackles the problem of distinguishing feature contributions in machine learning models by introducing Feature Attribution with Necessity and Sufficiency (FANS), which uses a dual-stage perturbation test to compute importance based on causal reasoning, and demonstrates that it outperforms existing methods on six benchmarks.

We investigate the problem of explainability for machine learning models, focusing on Feature Attribution Methods (FAMs) that evaluate feature importance through perturbation tests. Despite their utility, FAMs struggle to distinguish the contributions of different features, when their prediction changes are similar after perturbation. To enhance FAMs' discriminative power, we introduce Feature Attribution with Necessity and Sufficiency (FANS), which find a neighborhood of the input such that perturbing samples within this neighborhood have a high Probability of being Necessity and Sufficiency (PNS) cause for the change in predictions, and use this PNS as the importance of the feature. Specifically, FANS compute this PNS via a heuristic strategy for estimating the neighborhood and a perturbation test involving two stages (factual and interventional) for counterfactual reasoning. To generate counterfactual samples, we use a resampling-based approach on the observed samples to approximate the required conditional distribution. We demonstrate that FANS outperforms existing attribution methods on six benchmarks. Please refer to the source code via \url{https://github.com/DMIRLAB-Group/FANS}.

Code Implementations1 repo
Foundations

The foundational work for this paper's niche, ranked by how specifically the neighbourhood builds on it — not by global fame.

Your Notes