LGCVFeb 11, 2021

What does LIME really see in images?

arXiv:2102.06307v249 citations
AI Analysis

This work provides theoretical insights into LIME's behavior, which is important for researchers and practitioners using interpretability methods in AI, though it is incremental as it builds on existing methods.

The paper analyzes LIME, a popular interpretability method for computer vision, by deriving its theoretical limit explanation and uncovering a connection to integrated gradients, showing that LIME explanations approximate the sum of integrated gradients over superpixels.

The performance of modern algorithms on certain computer vision tasks such as object recognition is now close to that of humans. This success was achieved at the price of complicated architectures depending on millions of parameters and it has become quite challenging to understand how particular predictions are made. Interpretability methods propose to give us this understanding. In this paper, we study LIME, perhaps one of the most popular. On the theoretical side, we show that when the number of generated examples is large, LIME explanations are concentrated around a limit explanation for which we give an explicit expression. We further this study for elementary shape detectors and linear models. As a consequence of this analysis, we uncover a connection between LIME and integrated gradients, another explanation method. More precisely, the LIME explanations are similar to the sum of integrated gradients over the superpixels used in the preprocessing step of LIME.

Code Implementations1 repo
Foundations

The foundational work for this paper's niche, ranked by how specifically the neighbourhood builds on it — not by global fame.

Your Notes