AISep 3, 2025

An Empirical Evaluation of Factors Affecting SHAP Explanation of Time Series Classification

arXiv:2509.03649v1h-index: 8
Originality Incremental advance
AI Analysis

This work addresses the computational bottleneck in explainable AI for time series classification, offering practical insights for researchers and practitioners.

The study tackled the problem of optimizing SHAP explanations for time series classification by evaluating segmentation strategies, finding that the number of segments matters more than the method and that equal-length segmentation often performs best, with a novel normalization technique improving attribution quality.

Explainable AI (XAI) has become an increasingly important topic for understanding and attributing the predictions made by complex Time Series Classification (TSC) models. Among attribution methods, SHapley Additive exPlanations (SHAP) is widely regarded as an excellent attribution method; but its computational complexity, which scales exponentially with the number of features, limits its practicality for long time series. To address this, recent studies have shown that aggregating features via segmentation, to compute a single attribution value for a group of consecutive time points, drastically reduces SHAP running time. However, the choice of the optimal segmentation strategy remains an open question. In this work, we investigated eight different Time Series Segmentation algorithms to understand how segment compositions affect the explanation quality. We evaluate these approaches using two established XAI evaluation methodologies: InterpretTime and AUC Difference. Through experiments on both Multivariate (MTS) and Univariate Time Series (UTS), we find that the number of segments has a greater impact on explanation quality than the specific segmentation method. Notably, equal-length segmentation consistently outperforms most of the custom time series segmentation algorithms. Furthermore, we introduce a novel attribution normalisation technique that weights segments by their length and we show that it consistently improves attribution quality.

Foundations

The foundational work for this paper's niche, ranked by how specifically the neighbourhood builds on it — not by global fame.

Your Notes