CVLGNov 9, 2021

Self-Interpretable Model with TransformationEquivariant Interpretation

arXiv:2111.04927v129 citations
Originality Incremental advance
AI Analysis

This addresses the need for more reliable interpretation methods in computer vision, though it is incremental by focusing on specific transformations like translation and rotation.

The paper tackles the problem of generating robust and self-consistent interpretations for geometric transformations in self-interpretable models, proposing SITE which achieves comparable expressive power to black-box classifiers while providing faithful interpretations.

In this paper, we propose a self-interpretable model SITE with transformation-equivariant interpretations. We focus on the robustness and self-consistency of the interpretations of geometric transformations. Apart from the transformation equivariance, as a self-interpretable model, SITE has comparable expressive power as the benchmark black-box classifiers, while being able to present faithful and robust interpretations with high quality. It is worth noticing that although applied in most of the CNN visualization methods, the bilinear upsampling approximation is a rough approximation, which can only provide interpretations in the form of heatmaps (instead of pixel-wise). It remains an open question whether such interpretations can be direct to the input space (as shown in the MNIST experiments). Besides, we consider the translation and rotation transformations in our model. In future work, we will explore the robust interpretations under more complex transformations such as scaling and distortion. Moreover, we clarify that SITE is not limited to geometric transformation (that we used in the computer vision domain), and will explore SITEin other domains in future work.

Foundations

The foundational work for this paper's niche, ranked by how specifically the neighbourhood builds on it — not by global fame.

Your Notes