OCLGOct 12, 2025

Second-order Optimization under Heavy-Tailed Noise: Hessian Clipping and Sample Complexity Limits

arXiv:2510.10690v13 citationsh-index: 11
Originality Highly original
AI Analysis

This provides the first comprehensive theoretical framework for robust second-order optimization in heavy-tailed regimes, which is critical for applications with data heterogeneity and outliers.

The paper tackles the problem of second-order optimization under heavy-tailed noise, establishing tight lower bounds on sample complexity and developing a novel Hessian clipping algorithm that nearly matches these fundamental limits.

Heavy-tailed noise is pervasive in modern machine learning applications, arising from data heterogeneity, outliers, and non-stationary stochastic environments. While second-order methods can significantly accelerate convergence in light-tailed or bounded-noise settings, such algorithms are often brittle and lack guarantees under heavy-tailed noise -- precisely the regimes where robustness is most critical. In this work, we take a first step toward a theoretical understanding of second-order optimization under heavy-tailed noise. We consider a setting where stochastic gradients and Hessians have only bounded $p$-th moments, for some $p\in (1,2]$, and establish tight lower bounds on the sample complexity of any second-order method. We then develop a variant of normalized stochastic gradient descent that leverages second-order information and provably matches these lower bounds. To address the instability caused by large deviations, we introduce a novel algorithm based on gradient and Hessian clipping, and prove high-probability upper bounds that nearly match the fundamental limits. Our results provide the first comprehensive sample complexity characterization for second-order optimization under heavy-tailed noise. This positions Hessian clipping as a robust and theoretically sound strategy for second-order algorithm design in heavy-tailed regimes.

Foundations

The foundational work for this paper's niche, ranked by how specifically the neighbourhood builds on it — not by global fame.

Your Notes