LGMLJun 3, 2023

On Size-Independent Sample Complexity of ReLU Networks

arXiv:2306.01992v35 citationsh-index: 21
Originality Incremental advance
AI Analysis

This addresses theoretical generalization guarantees for neural networks, but it is incremental as it builds on previous bounds.

The paper tackles the problem of sample complexity for learning ReLU neural networks by refining existing bounds to often eliminate explicit depth-dependence, improving upon prior work that included a square-root depth factor.

We study the sample complexity of learning ReLU neural networks from the point of view of generalization. Given norm constraints on the weight matrices, a common approach is to estimate the Rademacher complexity of the associated function class. Previously Golowich-Rakhlin-Shamir (2020) obtained a bound independent of the network size (scaling with a product of Frobenius norms) except for a factor of the square-root depth. We give a refinement which often has no explicit depth-dependence at all.

Foundations

The foundational work for this paper's niche, ranked by how specifically the neighbourhood builds on it — not by global fame.

Your Notes