ML LGMay 18

Shallow ReLU$^s$ Networks in $L^p$-Type and Sobolev Spaces: Approximation and Path-Norm Controlled Generalization

arXiv:2605.184687.9

Predicted impact top 38% in ML · last 90 daysOriginality Incremental advance

AI Analysis

Provides theoretical guarantees for shallow ReLU^s networks in nonparametric regression, benefiting the machine learning theory community by extending approximation and generalization results to a broader class of function spaces.

This paper establishes approximation bounds for shallow ReLU^s networks in L^p-type and Sobolev spaces, improving random-feature rates, and proves minimax-optimal generalization bounds for path-norm-regularized networks with matching lower bounds.

We study approximation by shallow ReLU$^s$ networks, $σ_s(t)=\max{0,t}^s$, and the generalization behavior of such networks under $\ell_1$ path-norm control. For the $L^p$-type integral spaces $\widetilde{\mathcal{F}}_{p,τ_d,s}$, $1\le p\le2$, we establish approximation bounds for shallow networks using spherical harmonic analysis. In particular, when the parameter measure is the uniform measure $τ_d$ and $p<p^*=(2d+2)/(d+3)$, we obtain the rate $O(m^{-1/2-d(2-p)/(2d(2-p)+2p(2s+d+1))}\log^{3/2}m)$, which improves the corresponding random-feature rate. We also derive approximation rates for Sobolev spaces $W^{α,p}$ in the range $1\le p<2$ by embedding them into spectral Barron spaces. Finally, for nonparametric regression with sub-Gaussian noise, we prove minimax-optimal generalization bounds for path-norm-regularized shallow ReLU$^s$ networks over Barron and Sobolev spaces, with matching lower bounds up to logarithmic factors.

View on arXiv PDF

Similar