CVMLJan 20, 2023

Holistically Explainable Vision Transformers

arXiv:2301.08669v110 citationsh-index: 137
Originality Incremental advance
AI Analysis

This addresses the need for better interpretability in widely used transformer models, particularly for vision tasks, though it is incremental in method.

The authors tackled the problem of insufficient interpretability in Vision Transformers by proposing B-cos transformers, which provide holistic explanations through dynamic linear components, resulting in models that perform competitively on ImageNet.

Transformers increasingly dominate the machine learning landscape across many tasks and domains, which increases the importance for understanding their outputs. While their attention modules provide partial insight into their inner workings, the attention scores have been shown to be insufficient for explaining the models as a whole. To address this, we propose B-cos transformers, which inherently provide holistic explanations for their decisions. Specifically, we formulate each model component - such as the multi-layer perceptrons, attention layers, and the tokenisation module - to be dynamic linear, which allows us to faithfully summarise the entire transformer via a single linear transform. We apply our proposed design to Vision Transformers (ViTs) and show that the resulting models, dubbed Bcos-ViTs, are highly interpretable and perform competitively to baseline ViTs on ImageNet. Code will be made available soon.

Foundations

The foundational work for this paper's niche, ranked by how specifically the neighbourhood builds on it — not by global fame.

Your Notes