AIMay 11

Interpretable Machine Learning for Football Performance Analysis: Evidence of Limited Transferability from Elite Leagues to University Competition

Yu-Fang Tsai, Yu-Jen Chen, Kok-Hua Tan, Sheng-Chieh Huang, You-Ying Ji, Yu-Lun Chen, Chun-Yi Wang, Chien-Ming Hsu

arXiv:2605.107960.6

AI Analysis

For sports analytics practitioners, this work highlights that interpretable machine learning models trained on elite data may not reliably transfer to lower competition levels, challenging assumptions of domain transferability.

This study investigates whether performance determinants learned from elite football leagues transfer to university-level football. Results show that elite leagues exhibit stable performance hierarchies, while university football shows substantial reordering and reduced explanation stability, indicating that interpretability robustness is domain-dependent.

Machine learning has become increasingly prevalent in football performance analysis, yet most studies prioritize predictive accuracy while implicitly assuming that learned performance determinants and their interpretations are transferable across competition levels. Whether interpretability remains reliable under domain shift-from elite to university football remains largely unexplored. This study investigates whether performance determinants learned from elite competitions are structurally transferable to university-level football and whether their interpretations remain robust under domain shift. Models were trained on large-scale event data from the top five European leagues and applied to university football data from National Tsing Hua University (NTHU) using an identical feature space. Random Forest and Multilayer Perceptron models were interpreted using SHapley Additive exPlanations (SHAP) and Counterfactual Impact Score (CIS). Across five experiments, elite football exhibited a stable and consistent hierarchy of performance determinants across leagues, models, and explanation methods. In contrast, NTHU university football showed substantial reordering of key indicators, reduced explanation stability, weaker structural agreement with elite domains, and increased sensitivity to explanation method. These findings suggest that interpretability robustness is domain-dependent. Rather than reflecting methodological limitations alone, instability in explanations under domain shift may serve as a diagnostic signal of structural ambiguity in the target domain.

View on arXiv PDF

Similar