LGApr 15, 2024

Unsupervised Federated Optimization at the Edge: D2D-Enabled Learning without Labels

arXiv:2404.09861v16 citationsh-index: 28IEEE Trans Cogn Commun Netw
Originality Incremental advance
AI Analysis

This addresses the challenge of federated learning without labels for edge computing applications, though it is incremental by extending contrastive learning to federated settings.

The paper tackles the problem of federated learning with unlabeled data across edge devices by developing CF-CL, which uses device-to-device cooperation for data or embedding exchange. Results show it aligns latent spaces, speeds up training, and works well in non-i.i.d. settings.

Federated learning (FL) is a popular solution for distributed machine learning (ML). While FL has traditionally been studied for supervised ML tasks, in many applications, it is impractical to assume availability of labeled data across devices. To this end, we develop Cooperative Federated unsupervised Contrastive Learning ({\tt CF-CL)} to facilitate FL across edge devices with unlabeled datasets. {\tt CF-CL} employs local device cooperation where either explicit (i.e., raw data) or implicit (i.e., embeddings) information is exchanged through device-to-device (D2D) communications to improve local diversity. Specifically, we introduce a \textit{smart information push-pull} methodology for data/embedding exchange tailored to FL settings with either soft or strict data privacy restrictions. Information sharing is conducted through a probabilistic importance sampling technique at receivers leveraging a carefully crafted reserve dataset provided by transmitters. In the implicit case, embedding exchange is further integrated into the local ML training at the devices via a regularization term incorporated into the contrastive loss, augmented with a dynamic contrastive margin to adjust the volume of latent space explored. Numerical evaluations demonstrate that {\tt CF-CL} leads to alignment of latent spaces learned across devices, results in faster and more efficient global model training, and is effective in extreme non-i.i.d. data distribution settings across devices.

Foundations

The foundational work for this paper's niche, ranked by how specifically the neighbourhood builds on it — not by global fame.

Your Notes