LGDCMLJun 24, 2020

ByGARS: Byzantine SGD with Arbitrary Number of Attackers

arXiv:2006.13421v213 citations
Originality Incremental advance
AI Analysis

This addresses the challenge of securing distributed learning against arbitrary adversarial attacks, which is incremental as it builds on existing Byzantine robustness methods.

The paper tackles the problem of distributed machine learning with any number of Byzantine adversaries by proposing two SGD algorithms, ByGARS and ByGARS++, which use reputation scores from an auxiliary dataset for gradient aggregation, showing convergence for strongly convex functions and effectiveness on MNIST and CIFAR-10 against state-of-the-art attacks.

We propose two novel stochastic gradient descent algorithms, ByGARS and ByGARS++, for distributed machine learning in the presence of any number of Byzantine adversaries. In these algorithms, reputation scores of workers are computed using an auxiliary dataset at the server. This reputation score is then used for aggregating the gradients for stochastic gradient descent. The computational complexity of ByGARS++ is the same as the usual distributed stochastic gradient descent method with only an additional inner product computation in every iteration. We show that using these reputation scores for gradient aggregation is robust to any number of multiplicative noise Byzantine adversaries and use two-timescale stochastic approximation theory to prove convergence for strongly convex loss functions. We demonstrate the effectiveness of the algorithms for non-convex learning problems using MNIST and CIFAR-10 datasets against almost all state-of-the-art Byzantine attacks. We also show that the proposed algorithms are robust to multiple different types of attacks at the same time.

Foundations

The foundational work for this paper's niche, ranked by how specifically the neighbourhood builds on it — not by global fame.

Your Notes