DIS-NNLGMLSep 28, 2018

Deep learning systems as complex networks

arXiv:1809.10941v131 citations
AI Analysis

This addresses the interpretability challenge in deep learning for researchers, but it is incremental as it applies existing network analysis methods to a specific model.

The paper tackles the problem of understanding the opaque internal dynamics of deep belief networks by proposing to study them using complex network techniques, aiming to gain insights into their structural and functional properties after learning.

Thanks to the availability of large scale digital datasets and massive amounts of computational power, deep learning algorithms can learn representations of data by exploiting multiple levels of abstraction. These machine learning methods have greatly improved the state-of-the-art in many challenging cognitive tasks, such as visual object recognition, speech processing, natural language understanding and automatic translation. In particular, one class of deep learning models, known as deep belief networks, can discover intricate statistical structure in large data sets in a completely unsupervised fashion, by learning a generative model of the data using Hebbian-like learning mechanisms. Although these self-organizing systems can be conveniently formalized within the framework of statistical mechanics, their internal functioning remains opaque, because their emergent dynamics cannot be solved analytically. In this article we propose to study deep belief networks using techniques commonly employed in the study of complex networks, in order to gain some insights into the structural and functional properties of the computational graph resulting from the learning process.

Foundations

The foundational work for this paper's niche, ranked by how specifically the neighbourhood builds on it — not by global fame.

Your Notes