LG AIAug 20, 2025

Exact Shapley Attributions in Quadratic-time for FANOVA Gaussian Processes

Majid Mohammadi, Krikamol Muandet, Ilaria Tiddi, Annette Ten Teije, Siu Lun Chau

Oxford

arXiv:2508.14499v117.99 citationsh-index: 28

Originality Highly original

AI Analysis

This work addresses a computational bottleneck for researchers and practitioners using Shapley values in explainable AI, offering more scalable and uncertainty-aware explanations for structured probabilistic models.

The paper tackles the exponential computational scaling of exact Shapley value attributions for feature importance in machine learning, particularly for probabilistic models like Gaussian processes, by demonstrating that for FANOVA Gaussian processes, exact Shapley attributions for both local and global explanations can be computed in quadratic time.

Shapley values are widely recognized as a principled method for attributing importance to input features in machine learning. However, the exact computation of Shapley values scales exponentially with the number of features, severely limiting the practical application of this powerful approach. The challenge is further compounded when the predictive model is probabilistic - as in Gaussian processes (GPs) - where the outputs are random variables rather than point estimates, necessitating additional computational effort in modeling higher-order moments. In this work, we demonstrate that for an important class of GPs known as FANOVA GP, which explicitly models all main effects and interactions, *exact* Shapley attributions for both local and global explanations can be computed in *quadratic time*. For local, instance-wise explanations, we define a stochastic cooperative game over function components and compute the exact stochastic Shapley value in quadratic time only, capturing both the expected contribution and uncertainty. For global explanations, we introduce a deterministic, variance-based value function and compute exact Shapley values that quantify each feature's contribution to the model's overall sensitivity. Our methods leverage a closed-form (stochastic) Möbius representation of the FANOVA decomposition and introduce recursive algorithms, inspired by Newton's identities, to efficiently compute the mean and variance of Shapley values. Our work enhances the utility of explainable AI, as demonstrated by empirical studies, by providing more scalable, axiomatically sound, and uncertainty-aware explanations for predictions generated by structured probabilistic models.

View on arXiv PDF

Similar