LGAGCOOct 22, 2024

On Functional Dimension and Persistent Pseudodimension

arXiv:2410.17191v21 citationsh-index: 11
Originality Synthesis-oriented
AI Analysis

This work addresses a theoretical issue in machine learning for researchers studying neural network generalization, but it appears incremental as it builds on existing measures without presenting new empirical results.

The paper tackles the problem of understanding redundancy in ReLU neural network parameter spaces by analyzing local complexity measures, specifically functional dimension and persistent pseudodimension, to potentially explain generalization gaps and the double descent phenomenon.

For any fixed feedforward ReLU neural network architecture, it is well-known that many different parameter settings can determine the same function. It is less well-known that the degree of this redundancy is inhomogeneous across parameter space. In this work, we discuss two locally applicable complexity measures for ReLU network classes and what we know about the relationship between them: (1) the local functional dimension [14, 18], and (2) a local version of VC dimension that we call persistent pseudodimension. The former is easy to compute on finite batches of points; the latter should give local bounds on the generalization gap, which would inform an understanding of the mechanics of the double descent phenomenon [7].

Foundations

The foundational work for this paper's niche, ranked by how specifically the neighbourhood builds on it — not by global fame.

Your Notes