LGAIMLFeb 24, 2020

Breaking Batch Normalization for better explainability of Deep Neural Networks through Layer-wise Relevance Propagation

arXiv:2002.11018v121 citations
AI Analysis

This work addresses the lack of transparency in neural networks for users needing explainability, though it is incremental as it extends an existing technique.

The paper tackled the problem of Layerwise Relevance Propagation (LRP) not handling normalization layers in deep neural networks, by proposing a method to fuse normalization layers with convolutional or fully connected layers, resulting in more accurate heatmaps on MNIST and CIFAR-10 datasets for convolutional layers.

The lack of transparency of neural networks stays a major break for their use. The Layerwise Relevance Propagation technique builds heat-maps representing the relevance of each input in the model s decision. The relevance spreads backward from the last to the first layer of the Deep Neural Network. Layer-wise Relevance Propagation does not manage normalization layers, in this work we suggest a method to include normalization layers. Specifically, we build an equivalent network fusing normalization layers and convolutional or fully connected layers. Heatmaps obtained with our method on MNIST and CIFAR 10 datasets are more accurate for convolutional layers. Our study also prevents from using Layerwise Relevance Propagation with networks including a combination of connected layers and normalization layer.

Foundations

The foundational work for this paper's niche, ranked by how specifically the neighbourhood builds on it — not by global fame.

Your Notes