LGAIATMay 31, 2021

Persistent Homology Captures the Generalization of Neural Networks Without A Validation Set

arXiv:2106.00012v14 citations
Originality Incremental advance
AI Analysis

This work addresses the need for efficient generalization monitoring in neural network training, though it appears incremental as it builds on existing topological methods.

The authors tackled the problem of estimating neural network generalization without a validation set by using Persistent Homology to analyze training dynamics, finding that PH diagram distance correlates with validation accuracy, implying intrinsic estimation is possible.

The training of neural networks is usually monitored with a validation (holdout) set to estimate the generalization of the model. This is done instead of measuring intrinsic properties of the model to determine whether it is learning appropriately. In this work, we suggest studying the training of neural networks with Algebraic Topology, specifically Persistent Homology (PH). Using simplicial complex representations of neural networks, we study the PH diagram distance evolution on the neural network learning process with different architectures and several datasets. Results show that the PH diagram distance between consecutive neural network states correlates with the validation accuracy, implying that the generalization error of a neural network could be intrinsically estimated without any holdout set.

Code Implementations1 repo
Foundations

The foundational work for this paper's niche, ranked by how specifically the neighbourhood builds on it — not by global fame.

Your Notes