CVAIApr 30, 2019

Interpretation of Feature Space using Multi-Channel Attentional Sub-Networks

arXiv:1904.13078v13 citations
Originality Incremental advance
AI Analysis

This addresses interpretability challenges in CNNs for researchers and practitioners, though it appears incremental relative to existing visualization methods.

The paper tackles the problem of interpreting convolutional neural networks by developing a multi-channel attention mechanism that generates distinct attention masks for each feature and attribute, providing deeper insight into feature space while maintaining accurate multi-attribute recognition on benchmark datasets.

Convolutional Neural Networks have achieved impressive results in various tasks, but interpreting the internal mechanism is a challenging problem. To tackle this problem, we exploit a multi-channel attention mechanism in feature space. Our network architecture allows us to obtain an attention mask for each feature while existing CNN visualization methods provide only a common attention mask for all features. We apply the proposed multi-channel attention mechanism to multi-attribute recognition task. We can obtain different attention mask for each feature and for each attribute. Those analyses give us deeper insight into the feature space of CNNs. The experimental results for the benchmark dataset show that the proposed method gives high interpretability to humans while accurately grasping the attributes of the data.

Foundations

The foundational work for this paper's niche, ranked by how specifically the neighbourhood builds on it — not by global fame.

Your Notes