LGDCMar 3, 2025

GRAIN: Exact Graph Reconstruction from Gradients

arXiv:2503.01838v16 citationsh-index: 64Has CodeICLR
Originality Highly original
AI Analysis

This work addresses a critical privacy vulnerability for users of federated learning systems handling graph data, such as in molecular or social network applications, by revealing a new attack vector that is incremental to prior work on text and image data.

The paper tackles the problem of privacy risks in federated learning by demonstrating that gradient inversion attacks can exactly reconstruct graph-structured data, including both structure and node features, from shared gradients, achieving up to 80% exact reconstruction on molecular, citation, and social network datasets.

Federated learning claims to enable collaborative model training among multiple clients with data privacy by transmitting gradient updates instead of the actual client data. However, recent studies have shown the client privacy is still at risk due to the, so called, gradient inversion attacks which can precisely reconstruct clients' text and image data from the shared gradient updates. While these attacks demonstrate severe privacy risks for certain domains and architectures, the vulnerability of other commonly-used data types, such as graph-structured data, remain under-explored. To bridge this gap, we present GRAIN, the first exact gradient inversion attack on graph data in the honest-but-curious setting that recovers both the structure of the graph and the associated node features. Concretely, we focus on Graph Convolutional Networks (GCN) and Graph Attention Networks (GAT) -- two of the most widely used frameworks for learning on graphs. Our method first utilizes the low-rank structure of GNN gradients to efficiently reconstruct and filter the client subgraphs which are then joined to complete the input graph. We evaluate our approach on molecular, citation, and social network datasets using our novel metric. We show that GRAIN reconstructs up to 80% of all graphs exactly, significantly outperforming the baseline, which achieves up to 20% correctly positioned nodes.

Code Implementations1 repo
Foundations

The foundational work for this paper's niche, ranked by how specifically the neighbourhood builds on it — not by global fame.

Your Notes