MLAILGAug 25, 2020

Looking Deeper into Tabular LIME

arXiv:2008.11092v335 citations
Originality Incremental advance
AI Analysis

This work addresses the interpretability of black-box models in machine learning, particularly for tabular data, by analyzing LIME's behavior, though it is incremental as it builds on existing methods without introducing new paradigms.

The paper provides a theoretical analysis of Tabular LIME, proving that its interpretable coefficients can be explicitly computed in the large sample limit and offering insights for functions with algebraic structures like linear or multiplicative forms, while revealing artifacts in partition-based regressors that can lead to misleading explanations.

In this paper, we present a thorough theoretical analysis of the default implementation of LIME in the case of tabular data. We prove that in the large sample limit, the interpretable coefficients provided by Tabular LIME can be computed in an explicit way as a function of the algorithm parameters and some expectation computations related to the black-box model. When the function to explain has some nice algebraic structure (linear, multiplicative, or sparsely depending on a subset of the coordinates), our analysis provides interesting insights into the explanations provided by LIME. These can be applied to a range of machine learning models including Gaussian kernels or CART random forests. As an example, for linear functions we show that LIME has the desirable property to provide explanations that are proportional to the coefficients of the function to explain and to ignore coordinates that are not used by the function to explain. For partition-based regressors, on the other side, we show that LIME produces undesired artifacts that may provide misleading explanations.

Code Implementations1 repo
Foundations

The foundational work for this paper's niche, ranked by how specifically the neighbourhood builds on it — not by global fame.

Your Notes