ML AI LGJul 8, 2024

How DNNs break the Curse of Dimensionality: Compositionality and Symmetry Learning

Arthur Jacot, Seok Hoan Choi, Yuxiao Wen

arXiv:2407.05664v213.16 citationsh-index: 16Has Code

Originality Highly original

AI Analysis

This addresses a foundational problem in machine learning by theoretically explaining how DNNs achieve efficient high-dimensional learning, though it is incremental in refining existing theoretical frameworks.

The paper demonstrates that deep neural networks can efficiently learn compositions of functions with bounded F1-norm, enabling them to overcome the curse of dimensionality, unlike shallow networks, with empirical scaling laws showing phase transitions based on learning difficulty.

We show that deep neural networks (DNNs) can efficiently learn any composition of functions with bounded $F_{1}$-norm, which allows DNNs to break the curse of dimensionality in ways that shallow networks cannot. More specifically, we derive a generalization bound that combines a covering number argument for compositionality, and the $F_{1}$-norm (or the related Barron norm) for large width adaptivity. We show that the global minimizer of the regularized loss of DNNs can fit for example the composition of two functions $f^{*}=h\circ g$ from a small number of observations, assuming $g$ is smooth/regular and reduces the dimensionality (e.g. $g$ could be the quotient map of the symmetries of $f^{*}$), so that $h$ can be learned in spite of its low regularity. The measures of regularity we consider is the Sobolev norm with different levels of differentiability, which is well adapted to the $F_{1}$ norm. We compute scaling laws empirically and observe phase transitions depending on whether $g$ or $h$ is harder to learn, as predicted by our theory.

View on arXiv PDF Code

Similar