ML LGApr 3

Characterization of Gaussian Universality Breakdown in High-Dimensional Empirical Risk Minimization

Chiheb Yaakoubi, Cosme Louart, Malik Tiomoko, Zhenyu Liao

arXiv:2604.0314629.5

AI Analysis

This provides theoretical insights into the limits of Gaussian approximations for high-dimensional statistics, which is incremental but clarifies foundational assumptions in machine learning.

The paper tackles the problem of characterizing when Gaussian universality breaks down in high-dimensional empirical risk minimization under non-Gaussian data designs, showing that the estimator's projection approximately follows a convolution of non-Gaussian and Gaussian distributions.

We study high-dimensional convex empirical risk minimization (ERM) under general non-Gaussian data designs. By heuristically extending the Convex Gaussian Min-Max Theorem (CGMT) to non-Gaussian settings, we derive an asymptotic min-max characterization of key statistics, enabling approximation of the mean $Î¼_{\hatÎ¸}$ and covariance $C_{\hatÎ¸}$ of the ERM estimator $\hatÎ¸$. Specifically, under a concentration assumption on the data matrix and standard regularity conditions on the loss and regularizer, we show that for a test covariate $x$ independent of the training data, the projection $\hatÎ¸^\top x$ approximately follows the convolution of the (generally non-Gaussian) distribution of $Î¼_{\hatÎ¸}^\top x$ with an independent centered Gaussian variable of variance $\text{Tr}(C_{\hatÎ¸}\mathbb{E}[xx^\top])$. This result clarifies the scope and limits of Gaussian universality for ERMs. Additionally, we prove that any $\mathcal{C}^2$ regularizer is asymptotically equivalent to a quadratic form determined solely by its Hessian at zero and gradient at $Î¼_{\hatÎ¸}$. Numerical simulations across diverse losses and models are provided to validate our theoretical predictions and qualitative insights.

View on arXiv PDF

Similar