LG MLNov 3, 2025

Estimation of Toeplitz Covariance Matrices using Overparameterized Gradient Descent

arXiv:2511.01605v14.1h-index: 32

Originality Incremental advance

AI Analysis

This addresses covariance estimation for statistical modeling, offering a simpler and scalable alternative to existing optimization methods.

The paper tackles covariance estimation under Toeplitz structure by applying overparameterized gradient descent, showing that mild overparameterization enables global convergence and matches or exceeds state-of-the-art methods in accuracy.

We consider covariance estimation under Toeplitz structure. Numerous sophisticated optimization methods have been developed to maximize the Gaussian log-likelihood under Toeplitz constraints. In contrast, recent advances in deep learning demonstrate the surprising power of simple gradient descent (GD) applied to overparameterized models. Motivated by this trend, we revisit Toeplitz covariance estimation through the lens of overparameterized GD. We model the $P\times P$ covariance as a sum of $K$ complex sinusoids with learnable parameters and optimize them via GD. We show that when $K = P$, GD may converge to suboptimal solutions. However, mild overparameterization ($K = 2P$ or $4P$) consistently enables global convergence from random initializations. We further propose an accelerated GD variant with separate learning rates for amplitudes and frequencies. When frequencies are fixed and only amplitudes are optimized, we prove that the optimization landscape is asymptotically benign and any stationary point recovers the true covariance. Finally, numerical experiments demonstrate that overparameterized GD can match or exceed the accuracy of state-of-the-art methods in challenging settings, while remaining simple and scalable.

View on arXiv PDF

Similar