CVFeb 2, 2018

Visual Interpretability for Deep Learning: a Survey

arXiv:1802.00614v237.5912 citations
Originality Synthesis-oriented
AI Analysis

It addresses the interpretability problem in deep learning for researchers and practitioners, but it is incremental as it synthesizes existing studies rather than introducing new methods.

This survey reviews methods for improving the interpretability of deep neural networks, particularly convolutional neural networks, to address their black-box nature and potentially enhance learning from few annotations and human-computer interactions.

This paper reviews recent studies in understanding neural-network representations and learning neural networks with interpretable/disentangled middle-layer representations. Although deep neural networks have exhibited superior performance in various tasks, the interpretability is always the Achilles' heel of deep neural networks. At present, deep neural networks obtain high discrimination power at the cost of low interpretability of their black-box representations. We believe that high model interpretability may help people to break several bottlenecks of deep learning, e.g., learning from very few annotations, learning via human-computer communications at the semantic level, and semantically debugging network representations. We focus on convolutional neural networks (CNNs), and we revisit the visualization of CNN representations, methods of diagnosing representations of pre-trained CNNs, approaches for disentangling pre-trained CNN representations, learning of CNNs with disentangled representations, and middle-to-end learning based on model interpretability. Finally, we discuss prospective trends in explainable artificial intelligence.

Code Implementations1 repo
Foundations

The foundational work for this paper's niche, ranked by how specifically the neighbourhood builds on it — not by global fame.

Your Notes