LG OCMay 15, 2025

Asynchronous Decentralized SGD under Non-Convexity: A Block-Coordinate Descent Framework

arXiv:2505.10322v14.12 citationsh-index: 2

Originality Incremental advance

AI Analysis

This work addresses scalability and privacy challenges in decentralized learning for distributed systems, though it appears incremental as it builds on existing SGD frameworks with practical refinements.

The paper tackled the problem of asynchronous decentralized optimization under heterogeneous computation speeds and communication delays by introducing a refined Asynchronous Decentralized Stochastic Gradient Descent (ADSGD) model, showing it converges with computation-delay-independent step sizes and outperforms existing methods in wall-clock convergence time across various scenarios.

Decentralized optimization has become vital for leveraging distributed data without central control, enhancing scalability and privacy. However, practical deployments face fundamental challenges due to heterogeneous computation speeds and unpredictable communication delays. This paper introduces a refined model of Asynchronous Decentralized Stochastic Gradient Descent (ADSGD) under practical assumptions of bounded computation and communication times. To understand the convergence of ADSGD, we first analyze Asynchronous Stochastic Block Coordinate Descent (ASBCD) as a tool, and then show that ADSGD converges under computation-delay-independent step sizes. The convergence result is established without assuming bounded data heterogeneity. Empirical experiments reveal that ADSGD outperforms existing methods in wall-clock convergence time across various scenarios. With its simplicity, efficiency in memory and communication, and resilience to communication and computation delays, ADSGD is well-suited for real-world decentralized learning tasks.

View on arXiv PDF

Similar