LGMLSep 11, 2020

TREX: Tree-Ensemble Representer-Point Explanations

arXiv:2009.05530v37 citations
Originality Incremental advance
AI Analysis

This work addresses the need for interpretable machine learning in tree ensembles, offering a novel method for instance-attribution explanations that is incremental in adapting deep learning techniques to non-differentiable models.

The paper tackles the problem of identifying influential training examples for tree ensemble predictions by introducing TREX, an explanation system that builds a surrogate model using a custom kernel, achieving more effective dataset debugging and faster runtime than previous methods.

How can we identify the training examples that contribute most to the prediction of a tree ensemble? In this paper, we introduce TREX, an explanation system that provides instance-attribution explanations for tree ensembles, such as random forests and gradient boosted trees. TREX builds on the representer point framework previously developed for explaining deep neural networks. Since tree ensembles are non-differentiable, we define a kernel that captures the structure of the specific tree ensemble. By using this kernel in kernel logistic regression or a support vector machine, TREX builds a surrogate model that approximates the original tree ensemble. The weights in the kernel expansion of the surrogate model are used to define the global or local importance of each training example. Our experiments show that TREX's surrogate model accurately approximates the tree ensemble; its global importance weights are more effective in dataset debugging than the previous state-of-the-art; its explanations identify the most influential samples better than alternative methods under the remove and retrain evaluation framework; it runs orders of magnitude faster than alternative methods; and its local explanations can identify and explain errors due to domain mismatch.

Code Implementations1 repo
Foundations

The foundational work for this paper's niche, ranked by how specifically the neighbourhood builds on it — not by global fame.

Your Notes