LGAISep 24, 2024

A Historical Trajectory Assisted Optimization Method for Zeroth-Order Federated Learning

arXiv:2409.15955v5h-index: 2
Originality Incremental advance
AI Analysis

This work addresses inefficiencies in federated learning for scenarios where gradient information is unavailable, offering an incremental improvement over existing zeroth-order optimization algorithms.

The paper tackles the problem of high gradient estimation errors in zeroth-order federated learning by proposing a non-isotropic sampling method based on historical trajectories, showing convergence rates comparable to existing methods without significant overheads.

Federated learning heavily relies on distributed gradient descent techniques. In the situation where gradient information is not available, the gradients need to be estimated from zeroth-order information, which typically involves computing finite-differences along isotropic random directions. This method suffers from high estimation errors, as the geometric features of the objective landscape may be overlooked during the isotropic sampling. In this work, we propose a non-isotropic sampling method to improve the gradient estimation procedure. Gradients in our method are estimated in a subspace spanned by historical trajectories of solutions, aiming to encourage the exploration of promising regions and hence improve the convergence. The proposed method uses a covariance matrix for sampling which is a convex combination of two parts. The first part is a thin projection matrix containing the basis of the subspace which is designed to improve the exploitation ability. The second part is the historical trajectories. We implement this method in zeroth-order federated settings, and show that the convergence rate aligns with existing ones while introducing no significant overheads in communication or local computation. The effectiveness of our proposal is verified on several numerical experiments in comparison to several commonly-used zeroth-order federated optimization algorithms.

Foundations

The foundational work for this paper's niche, ranked by how specifically the neighbourhood builds on it — not by global fame.

Your Notes