Periodic Spectral Ergodicity: A Complexity Measure for Deep Neural Networks and Neural Architecture Search

arXiv:1911.07831v42 citations
Originality Incremental advance
AI Analysis

This work addresses a fundamental problem in machine learning for researchers and practitioners by providing a tool to understand and optimize neural network architectures, though it appears incremental as it builds on existing concepts from quantum statistical mechanics.

The authors tackled the challenge of linking neural network structure to generalization by developing a new complexity measure called Periodic Spectral Ergodicity (PSE), which quantifies both topological and internal processing complexity from learned weights, and demonstrated its practical application on ResNet and VGG models.

Establishing associations between the structure and the generalisation ability of deep neural networks (DNNs) is a challenging task in modern machine learning. Producing solutions to this challenge will bring progress both in the theoretical understanding of DNNs and in building new architectures efficiently. In this work, we address this challenge by developing a new complexity measure based on the concept of {Periodic Spectral Ergodicity} (PSE) originating from quantum statistical mechanics. Based on this measure a technique is devised to quantify the complexity of deep neural networks from the learned weights and traversing the network connectivity in a sequential manner, hence the term cascading PSE (cPSE), as an empirical complexity measure. This measure will capture both topological and internal neural processing complexity simultaneously. Because of this cascading approach, i.e., a symmetric divergence of PSE on the consecutive layers, it is possible to use this measure for Neural Architecture Search (NAS). We demonstrate the usefulness of this measure in practice on two sets of vision models, ResNet and VGG, and sketch the computation of cPSE for more complex network structures.

Code Implementations1 repo
Foundations

The foundational work for this paper's niche, ranked by how specifically the neighbourhood builds on it — not by global fame.

Your Notes