LGMLFeb 6, 2024

Probabilistic Shapley Value Modeling and Inference

arXiv:2402.04211v2h-index: 31
Originality Incremental advance
AI Analysis

This provides a probabilistic approach to feature attribution uncertainty for users of flexible predictive models, though it builds incrementally on existing Shapley value methods.

The authors tackled the challenge of modeling uncertainty in feature attributions for predictive models by proposing Probabilistic Shapley Inference (PSI), a framework that learns attribution distributions centered at Shapley values while maintaining competitive predictive performance on synthetic and real-world datasets.

We propose probabilistic Shapley inference (PSI), a novel probabilistic framework to model and infer sufficient statistics of feature attributions in flexible predictive models, via latent random variables whose mean recovers Shapley values. PSI enables efficient, scalable inference over input-to-output attributions, and their uncertainty, via a variational objective that jointly trains a predictive (regression or classification) model and its attribution distributions. To address the challenge of marginalizing over variable-length input feature subsets in Shapley value calculation, we introduce a masking-based neural network architecture, with a modular training and inference procedure. We evaluate PSI on synthetic and real-world datasets, showing that it achieves competitive predictive performance compared to strong baselines, while learning feature attribution distributions -- centered at Shapley values -- that reveal meaningful attribution uncertainty across data modalities.

Foundations

The foundational work for this paper's niche, ranked by how specifically the neighbourhood builds on it — not by global fame.

Your Notes