LGNov 18, 2024

Federated Contrastive Learning of Graph-Level Representations

arXiv:2411.12098v12 citationsh-index: 24BigData
Originality Incremental advance
AI Analysis

This addresses the need for federated learning in graph-level tasks like network traffic analysis and protein property prediction, where data privacy or size prevents central sharing, representing an incremental advancement in combining federated learning with graph representation learning.

The paper tackles the problem of learning graph-level representations in a federated setting where data cannot be centrally shared, proposing a framework called Federated Contrastive Learning of Graph-level Representations (FCLG) that applies contrastive learning at two levels to address local learning and data distribution variation, and demonstrates it outperforms baselines in graph-level clustering with significant margins.

Graph-level representations (and clustering/classification based on these representations) are required in a variety of applications. Examples include identifying malicious network traffic, prediction of protein properties, and many others. Often, data has to stay in isolated local systems (i.e., cannot be centrally shared for analysis) due to a variety of considerations like privacy concerns, lack of trust between the parties, regulations, or simply because the data is too large to be shared sufficiently quickly. This points to the need for federated learning for graph-level representations, a topic that has not been explored much, especially in an unsupervised setting. Addressing this problem, this paper presents a new framework we refer to as Federated Contrastive Learning of Graph-level Representations (FCLG). As the name suggests, our approach builds on contrastive learning. However, what is unique is that we apply contrastive learning at two levels. The first application is for local unsupervised learning of graph representations. The second level is to address the challenge associated with data distribution variation (i.e. the ``Non-IID issue") when combining local models. Through extensive experiments on the downstream task of graph-level clustering, we demonstrate FCLG outperforms baselines (which apply existing federated methods on existing graph-level clustering methods) with significant margins.

Foundations

The foundational work for this paper's niche, ranked by how specifically the neighbourhood builds on it — not by global fame.

Your Notes