LGDCMAOCJan 30, 2024

Rendering Wireless Environments Useful for Gradient Estimators: A Zero-Order Stochastic Federated Learning Method

arXiv:2401.17460v32 citationsh-index: 32024 60th Annual Allerton Conference on Communication, Control, and Computing
Originality Highly original
AI Analysis

This addresses communication efficiency for federated learning applications on mobile devices, representing a novel integration of wireless channel properties into the learning algorithm.

The paper tackles the communication bottleneck in cross-device federated learning over wireless networks by proposing a zero-order method that replaces gradient vectors with scalar values and leverages the wireless channel directly, achieving convergence with a rate of O(1/∛K) in nonconvex settings.

Cross-device federated learning (FL) is a growing machine learning setting whereby multiple edge devices collaborate to train a model without disclosing their raw data. With the great number of mobile devices participating in more FL applications via the wireless environment, the practical implementation of these applications will be hindered due to the limited uplink capacity of devices, causing critical bottlenecks. In this work, we propose a novel doubly communication-efficient zero-order (ZO) method with a one-point gradient estimator that replaces communicating long vectors with scalar values and that harnesses the nature of the wireless communication channel, overcoming the need to know the channel state coefficient. It is the first method that includes the wireless channel in the learning algorithm itself instead of wasting resources to analyze it and remove its impact. We then offer a thorough analysis of the proposed zero-order federated learning (ZOFL) framework and prove that our method converges \textit{almost surely}, which is a novel result in nonconvex ZO optimization. We further prove a convergence rate of $O(\frac{1}{\sqrt[3]{K}})$ in the nonconvex setting. We finally demonstrate the potential of our algorithm with experimental results.

Foundations

The foundational work for this paper's niche, ranked by how specifically the neighbourhood builds on it — not by global fame.

Your Notes