LGAIDec 13, 2020

Demystifying Deep Neural Networks Through Interpretation: A Survey

arXiv:2012.07119v21 citations
AI Analysis

This survey addresses the problem of understanding and debugging deep neural networks for researchers and practitioners, which is an incremental contribution to the field.

This paper surveys existing works on interpreting deep neural networks, addressing the problem that optimizing a single objective metric is an incomplete description of real-world tasks and hinders understanding and fixing errors. The survey aims to provide insights into neural network behavior and thought processes.

Modern deep learning algorithms tend to optimize an objective metric, such as minimize a cross entropy loss on a training dataset, to be able to learn. The problem is that the single metric is an incomplete description of the real world tasks. The single metric cannot explain why the algorithm learn. When an erroneous happens, the lack of interpretability causes a hardness of understanding and fixing the error. Recently, there are works done to tackle the problem of interpretability to provide insights into neural networks behavior and thought process. The works are important to identify potential bias and to ensure algorithm fairness as well as expected performance.

Foundations

The foundational work for this paper's niche, ranked by how specifically the neighbourhood builds on it — not by global fame.

Your Notes