LG SYSep 23, 2024

Peer-to-Peer Learning Dynamics of Wide Neural Networks

Shreyas Chaudhari, Srinivasa Pranav, Emile Anand, José M. F. Moura

arXiv:2409.15267v312.58 citationsh-index: 4

Originality Synthesis-oriented

AI Analysis

This work addresses the need for tuning neural networks in distributed edge settings like smart cities, offering insights for privacy-preserving collaborative training, though it is incremental as it builds on existing NTK and distributed learning theories.

The paper tackles the challenge of characterizing training dynamics for wide neural networks in peer-to-peer learning environments, leveraging neural tangent kernel theory and distributed consensus to provide explicit analytical predictions validated on classification tasks.

Peer-to-peer learning is an increasingly popular framework that enables beyond-5G distributed edge devices to collaboratively train deep neural networks in a privacy-preserving manner without the aid of a central server. Neural network training algorithms for emerging environments, e.g., smart cities, have many design considerations that are difficult to tune in deployment settings -- such as neural network architectures and hyperparameters. This presents a critical need for characterizing the training dynamics of distributed optimization algorithms used to train highly nonconvex neural networks in peer-to-peer learning environments. In this work, we provide an explicit characterization of the learning dynamics of wide neural networks trained using popular distributed gradient descent (DGD) algorithms. Our results leverage both recent advancements in neural tangent kernel (NTK) theory and extensive previous work on distributed learning and consensus. We validate our analytical results by accurately predicting the parameter and error dynamics of wide neural networks trained for classification tasks.

View on arXiv PDF

Similar