LGJan 18, 2025

Distributed Quasi-Newton Method for Fair and Fast Federated Learning

arXiv:2501.10877v14 citationsh-index: 8Trans. Mach. Learn. Res.
Originality Incremental advance
AI Analysis

This addresses fairness and efficiency issues in federated learning for edge devices, though it is an incremental improvement by combining quasi-Newton methods with fairness constraints.

The paper tackles the problem of slow convergence and unfairness in federated learning by introducing DQN-Fed, a distributed quasi-Newton method that ensures fairness and accelerates training, achieving linear-quadratic convergence and outperforming state-of-the-art methods in fairness, accuracy, and speed.

Federated learning (FL) is a promising technology that enables edge devices/clients to collaboratively and iteratively train a machine learning model under the coordination of a central server. The most common approach to FL is first-order methods, where clients send their local gradients to the server in each iteration. However, these methods often suffer from slow convergence rates. As a remedy, second-order methods, such as quasi-Newton, can be employed in FL to accelerate its convergence. Unfortunately, similarly to the first-order FL methods, the application of second-order methods in FL can lead to unfair models, achieving high average accuracy while performing poorly on certain clients' local datasets. To tackle this issue, in this paper we introduce a novel second-order FL framework, dubbed \textbf{d}istributed \textbf{q}uasi-\textbf{N}ewton \textbf{fed}erated learning (DQN-Fed). This approach seeks to ensure fairness while leveraging the fast convergence properties of quasi-Newton methods in the FL context. Specifically, DQN-Fed helps the server update the global model in such a way that (i) all local loss functions decrease to promote fairness, and (ii) the rate of change in local loss functions aligns with that of the quasi-Newton method. We prove the convergence of DQN-Fed and demonstrate its \textit{linear-quadratic} convergence rate. Moreover, we validate the efficacy of DQN-Fed across a range of federated datasets, showing that it surpasses state-of-the-art fair FL methods in fairness, average accuracy and convergence speed.

Foundations

The foundational work for this paper's niche, ranked by how specifically the neighbourhood builds on it — not by global fame.

Your Notes