LGCRMLJun 22, 2025

Byzantine Failures Harm the Generalization of Robust Distributed Learning Algorithms More Than Data Poisoning

arXiv:2506.18020v2h-index: 31
Originality Incremental advance
AI Analysis

This work addresses a fundamental gap in understanding generalization in distributed learning for security applications, providing theoretical insights that are incremental but important for designing more resilient algorithms.

The paper tackles the problem of how different threat models affect the generalization of robust distributed learning algorithms, showing that Byzantine failures lead to strictly worse generalization rates than data poisoning, with proven degradation factors of Θ(f/(n-f)) for data poisoning and Ω(√(f/(n-2f))) for Byzantine failures.

Robust distributed learning algorithms aim to maintain reliable performance despite the presence of misbehaving workers. Such misbehaviors are commonly modeled as Byzantine failures, allowing arbitrarily corrupted communication, or as data poisoning, a weaker form of corruption restricted to local training data. While prior work shows similar optimization guarantees for both models, an important question remains: How do these threat models impact generalization? Empirical evidence suggests a gap, yet it remains unclear whether it is unavoidable or merely an artifact of suboptimal attacks. We show, for the first time, a fundamental gap in generalization guarantees between the two threat models: Byzantine failures yield strictly worse rates than those achievable under data poisoning. Our findings leverage a tight algorithmic stability analysis of robust distributed learning. Specifically, we prove that: (i) under data poisoning, the uniform algorithmic stability of an algorithm with optimal optimization guarantees degrades by an additive factor of $\varTheta ( \frac{f}{n-f} )$, with $f$ out of $n$ workers misbehaving; whereas $\textit{(ii)}$ under Byzantine failures, the degradation is in $Ω\big( \sqrt{ \frac{f}{n-2f}} \big)$.

Foundations

The foundational work for this paper's niche, ranked by how specifically the neighbourhood builds on it — not by global fame.

Your Notes