LG CR MLJun 22, 2025

Byzantine Failures Harm the Generalization of Robust Distributed Learning Algorithms More Than Data Poisoning

Thomas Boudou, Batiste Le Bars, Nirupam Gupta, Aurélien Bellet

arXiv:2506.18020v24.1h-index: 31

Originality Incremental advance

AI Analysis

This work addresses a fundamental gap in understanding generalization in distributed learning for security applications, providing theoretical insights that are incremental but important for designing more resilient algorithms.

The paper tackles the problem of how different threat models affect the generalization of robust distributed learning algorithms, showing that Byzantine failures lead to strictly worse generalization rates than data poisoning, with proven degradation factors of Θ(f/(n-f)) for data poisoning and Ω(√(f/(n-2f))) for Byzantine failures.

Robust distributed learning algorithms aim to maintain reliable performance despite the presence of misbehaving workers. Such misbehaviors are commonly modeled as Byzantine failures, allowing arbitrarily corrupted communication, or as data poisoning, a weaker form of corruption restricted to local training data. While prior work shows similar optimization guarantees for both models, an important question remains: How do these threat models impact generalization? Empirical evidence suggests a gap, yet it remains unclear whether it is unavoidable or merely an artifact of suboptimal attacks. We show, for the first time, a fundamental gap in generalization guarantees between the two threat models: Byzantine failures yield strictly worse rates than those achievable under data poisoning. Our findings leverage a tight algorithmic stability analysis of robust distributed learning. Specifically, we prove that: (i) under data poisoning, the uniform algorithmic stability of an algorithm with optimal optimization guarantees degrades by an additive factor of $\varTheta ( \frac{f}{n-f} )$, with $f$ out of $n$ workers misbehaving; whereas $\textit{(ii)}$ under Byzantine failures, the degradation is in $Ω\big( \sqrt{ \frac{f}{n-2f}} \big)$.

View on arXiv PDF

Similar