LGOCAug 19, 2024

Faster Adaptive Decentralized Learning Algorithms

arXiv:2408.09775v14 citationsh-index: 3
Originality Incremental advance
AI Analysis

This work addresses efficiency issues in decentralized machine learning for applications like neural network training, though it is incremental as it builds on existing adaptive methods.

The paper tackles the problem of high sample complexity in adaptive decentralized learning algorithms by proposing AdaMDOS and AdaMDOF, achieving near-optimal sample complexities of Õ(ε^{-3}) for stochastic optimization and O(√n ε^{-2}) for finite-sum optimization, respectively.

Decentralized learning recently has received increasing attention in machine learning due to its advantages in implementation simplicity and system robustness, data privacy. Meanwhile, the adaptive gradient methods show superior performances in many machine learning tasks such as training neural networks. Although some works focus on studying decentralized optimization algorithms with adaptive learning rates, these adaptive decentralized algorithms still suffer from high sample complexity. To fill these gaps, we propose a class of faster adaptive decentralized algorithms (i.e., AdaMDOS and AdaMDOF) for distributed nonconvex stochastic and finite-sum optimization, respectively. Moreover, we provide a solid convergence analysis framework for our methods. In particular, we prove that our AdaMDOS obtains a near-optimal sample complexity of $\tilde{O}(ε^{-3})$ for finding an $ε$-stationary solution of nonconvex stochastic optimization. Meanwhile, our AdaMDOF obtains a near-optimal sample complexity of $O(\sqrt{n}ε^{-2})$ for finding an $ε$-stationary solution of nonconvex finite-sum optimization, where $n$ denotes the sample size. To the best of our knowledge, our AdaMDOF algorithm is the first adaptive decentralized algorithm for nonconvex finite-sum optimization. Some experimental results demonstrate efficiency of our algorithms.

Foundations

The foundational work for this paper's niche, ranked by how specifically the neighbourhood builds on it — not by global fame.

Your Notes