LGAug 23, 2025

Reconciling Communication Compression and Byzantine-Robustness in Distributed Learning

arXiv:2508.17129v21 citationsh-index: 7
Originality Incremental advance
AI Analysis

This addresses a critical problem for scalable distributed learning systems by reconciling efficiency and security, though it is incremental as it builds on existing methods.

The paper tackles the combined challenge of communication compression and Byzantine robustness in distributed learning, introducing RoSDHB, which matches theoretical convergence guarantees of prior work while demonstrating stronger robustness and substantial communication savings empirically.

Distributed learning enables scalable model training over decentralized data, but remains hindered by Byzantine faults and high communication costs. While both challenges have been studied extensively in isolation, their interplay has received limited attention. Prior work has shown that naively combining communication compression with Byzantine-robust aggregation can severely weaken resilience to faulty nodes. The current state-of-the-art, Byz-DASHA-PAGE, leverages a momentum-based variance reduction scheme to counteract the negative effect of compression noise on Byzantine robustness. In this work, we introduce RoSDHB, a new algorithm that integrates classical Polyak momentum with a coordinated compression strategy. Theoretically, RoSDHB matches the convergence guarantees of Byz-DASHA-PAGE under the standard $(G,B)$-gradient dissimilarity model, while relying on milder assumptions and requiring less memory and communication per client. Empirically, RoSDHB demonstrates stronger robustness while achieving substantial communication savings compared to Byz-DASHA-PAGE.

Foundations

The foundational work for this paper's niche, ranked by how specifically the neighbourhood builds on it — not by global fame.

Your Notes