ML LG OC PRMar 25, 2025

A stochastic gradient descent algorithm with random search directions

arXiv:2503.19942v2h-index: 1Trans. Mach. Learn. Res.

Originality Incremental advance

AI Analysis

This work addresses a methodological bottleneck in optimization algorithms for machine learning, offering a more flexible approach but is incremental in extending existing methods.

The paper tackles the limitation of stochastic coordinate descent being restricted to canonical basis vectors by developing a new class of stochastic gradient descent algorithms with random search directions, establishing almost sure convergence, analyzing central limit theorems, and providing non-asymptotic convergence rates.

Stochastic coordinate descent algorithms are efficient methods in which each iterate is obtained by fixing most coordinates at their values from the current iteration, and approximately minimizing the objective with respect to the remaining coordinates. However, this approach is usually restricted to canonical basis vectors of $\mathbb{R}^d$. In this paper, we develop a new class of stochastic gradient descent algorithms with random search directions which uses the directional derivative of the gradient estimate following more general random vectors. We establish the almost sure convergence of these algorithms with decreasing step. We further investigate their central limit theorem and pay particular attention to analyze the impact of the search distributions on the asymptotic covariance matrix. We also provide non-asymptotic $\mathbb{L}^p$ rates of convergence.

View on arXiv PDF

Similar