NELGMLApr 30, 2015

Deep Neural Networks with Random Gaussian Weights: A Universal Classification Strategy?

arXiv:1504.08291v5190 citations
Originality Incremental advance
AI Analysis

This work provides theoretical insights into deep learning for researchers, though it is incremental as it builds on existing compressed sensing and dictionary learning tools.

The authors tackled the problem of understanding fundamental classification properties in deep neural networks by proving that networks with random Gaussian weights perform distance-preserving embeddings, with theoretical bounds on training set size and validation using state-of-the-art networks.

Three important properties of a classification machinery are: (i) the system preserves the core information of the input data; (ii) the training examples convey information about unseen data; and (iii) the system is able to treat differently points from different classes. In this work we show that these fundamental properties are satisfied by the architecture of deep neural networks. We formally prove that these networks with random Gaussian weights perform a distance-preserving embedding of the data, with a special treatment for in-class and out-of-class data. Similar points at the input of the network are likely to have a similar output. The theoretical analysis of deep networks here presented exploits tools used in the compressed sensing and dictionary learning literature, thereby making a formal connection between these important topics. The derived results allow drawing conclusions on the metric learning properties of the network and their relation to its structure, as well as providing bounds on the required size of the training set such that the training examples would represent faithfully the unseen data. The results are validated with state-of-the-art trained networks.

Foundations

The foundational work for this paper's niche, ranked by how specifically the neighbourhood builds on it — not by global fame.

Your Notes