CRLGFeb 21, 2022

Degree-Preserving Randomized Response for Graph Neural Networks under Local Differential Privacy

arXiv:2202.10209v616 citations
Originality Highly original
AI Analysis

This addresses privacy concerns for users in social networks by enabling strong edge protection in GNNs without a trusted third party, representing a novel solution for an open problem in unattributed graphs.

The paper tackles the problem of protecting edge privacy in unattributed graphs for Graph Neural Networks (GNNs) under Local Differential Privacy (LDP), proposing the DPRR algorithm that preserves node degrees and graph structure. The result shows that DPRR significantly outperforms baselines and achieves accuracy close to non-private algorithms, e.g., with epsilon=1, on three social graph datasets.

Differentially private GNNs (Graph Neural Networks) have been recently studied to provide high accuracy in various tasks on graph data while strongly protecting user privacy. In particular, a recent study proposes an algorithm to protect each user's feature vector in an attributed graph, which includes feature vectors along with node IDs and edges, with LDP (Local Differential Privacy), a strong privacy notion without a trusted third party. However, this algorithm does not protect edges (friendships) in a social graph, hence cannot protect user privacy in unattributed graphs, which include only node IDs and edges. How to provide strong privacy with high accuracy in unattributed graphs remains open. In this paper, we propose a novel LDP algorithm called the DPRR (Degree-Preserving Randomized Response) to provide LDP for edges in GNNs. Our DPRR preserves each user's degree hence a graph structure while providing edge LDP. Technically, our DPRR uses Warner's RR (Randomized Response) and strategic edge sampling, where each user's sampling probability is automatically tuned using the Laplacian mechanism to preserve the degree information under edge LDP. We also propose a privacy budget allocation method to make the noise in both Warner's RR and the Laplacian mechanism small. We focus on graph classification as a task of GNNs and evaluate the DPRR using three social graph datasets. Our experimental results show that the DPRR significantly outperforms three baselines and provides accuracy close to a non-private algorithm in all datasets with a reasonable privacy budget, e.g., epsilon=1. Finally, we introduce data poisoning attacks to our DPRR and a defense against the attacks. We evaluate them using the three social graph datasets and discuss the experimental results.

Foundations

The foundational work for this paper's niche, ranked by how specifically the neighbourhood builds on it — not by global fame.

Your Notes