AILGMLOct 9, 2017

Function space analysis of deep learning representation layers

arXiv:1710.03263v14 citations
Originality Incremental advance
AI Analysis

This work addresses the need for better theoretical understanding of deep learning representations, which could help demystify aspects of the field, though it appears incremental as it builds on existing methods applied to other algorithms.

The paper tackles the problem of analyzing representation layers in deep learning by proposing a function space approach using a weak-type Besov smoothness index to quantify feature space geometry, and shows that this smoothness increases across layers in trained networks and decreases with added mis-labeling in training data.

In this paper we propose a function space approach to Representation Learning and the analysis of the representation layers in deep learning architectures. We show how to compute a weak-type Besov smoothness index that quantifies the geometry of the clustering in the feature space. This approach was already applied successfully to improve the performance of machine learning algorithms such as the Random Forest and tree-based Gradient Boosting. Our experiments demonstrate that in well-known and well-performing trained networks, the Besov smoothness of the training set, measured in the corresponding hidden layer feature map representation, increases from layer to layer. We also contribute to the understanding of generalization by showing how the Besov smoothness of the representations, decreases as we add more mis-labeling to the training data. We hope this approach will contribute to the de-mystification of some aspects of deep learning.

Foundations

The foundational work for this paper's niche, ranked by how specifically the neighbourhood builds on it — not by global fame.

Your Notes