DIS-NNLGMLFeb 7, 2025

Two-Point Deterministic Equivalence for Stochastic Gradient Dynamics in Linear Models

Harvard
arXiv:2502.05074v36 citationsh-index: 28
Originality Highly original
AI Analysis

This work provides a unified understanding of stochastic gradient descent performance for high-dimensional linear models, which is significant for machine learning researchers and practitioners working with these models.

The authors derived a novel deterministic equivalence for stochastic gradient dynamics in linear models, unifying the performance analysis of various high-dimensional linear models. This includes previously known and novel asymptotics for models such as linear regression, kernel regression, and linear random feature models.

We derive a novel deterministic equivalence for the two-point function of a random matrix resolvent. Using this result, we give a unified derivation of the performance of a wide variety of high-dimensional linear models trained with stochastic gradient descent. This includes high-dimensional linear regression, kernel regression, and linear random feature models. Our results include previously known asymptotics as well as novel ones.

Foundations

The foundational work for this paper's niche, ranked by how specifically the neighbourhood builds on it — not by global fame.

Your Notes