MLLGMay 25, 2023

Interpretable Machine Learning based on Functional ANOVA Framework: Algorithms and Comparisons

arXiv:2305.15670v19 citations
Originality Incremental advance
AI Analysis

This work addresses the need for inherently interpretable models in machine learning, though it is incremental as it builds on existing fANOVA-based techniques like EBM and GAMI-Net.

The paper tackles the challenge of interpretable machine learning by proposing a new algorithm, GAMI-Lin-T, which uses linear fits within tree partitions and includes a new interaction filtering algorithm. Results on simulated and real datasets show that GAMI-Lin-T and GAMI-Net have comparable performances and generally outperform EBM.

In the early days of machine learning (ML), the emphasis was on developing complex algorithms to achieve best predictive performance. To understand and explain the model results, one had to rely on post hoc explainability techniques, which are known to have limitations. Recently, with the recognition that interpretability is just as important, researchers are compromising on small increases in predictive performance to develop algorithms that are inherently interpretable. While doing so, the ML community has rediscovered the use of low-order functional ANOVA (fANOVA) models that have been known in the statistical literature for some time. This paper starts with a description of challenges with post hoc explainability and reviews the fANOVA framework with a focus on main effects and second-order interactions. This is followed by an overview of two recently developed techniques: Explainable Boosting Machines or EBM (Lou et al., 2013) and GAMI-Net (Yang et al., 2021b). The paper proposes a new algorithm, called GAMI-Lin-T, that also uses trees like EBM, but it does linear fits instead of piecewise constants within the partitions. There are many other differences, including the development of a new interaction filtering algorithm. Finally, the paper uses simulated and real datasets to compare selected ML algorithms. The results show that GAMI-Lin-T and GAMI-Net have comparable performances, and both are generally better than EBM.

Foundations

The foundational work for this paper's niche, ranked by how specifically the neighbourhood builds on it — not by global fame.

Your Notes