LGAISIMay 28, 2022

Large-Scale Privacy-Preserving Network Embedding against Private Link Inference Attacks

arXiv:2205.14440v14 citationsh-index: 14
Originality Incremental advance
AI Analysis

This addresses privacy risks in network analysis for applications like social networks, but it is incremental as it builds on existing embedding methods.

The paper tackles the problem of network embeddings leaking private link information by proposing a perturbation-based framework (PPNE) that modifies networks to protect privacy while maintaining utility for downstream tasks, achieving higher privacy protection with less utility loss compared to baselines on datasets with up to millions of nodes.

Network embedding represents network nodes by a low-dimensional informative vector. While it is generally effective for various downstream tasks, it may leak some private information of networks, such as hidden private links. In this work, we address a novel problem of privacy-preserving network embedding against private link inference attacks. Basically, we propose to perturb the original network by adding or removing links, and expect the embedding generated on the perturbed network can leak little information about private links but hold high utility for various downstream tasks. Towards this goal, we first propose general measurements to quantify privacy gain and utility loss incurred by candidate network perturbations; we then design a PPNE framework to identify the optimal perturbation solution with the best privacy-utility trade-off in an iterative way. Furthermore, we propose many techniques to accelerate PPNE and ensure its scalability. For instance, as the skip-gram embedding methods including DeepWalk and LINE can be seen as matrix factorization with closed form embedding results, we devise efficient privacy gain and utility loss approximation methods to avoid the repetitive time-consuming embedding training for every candidate network perturbation in each iteration. Experiments on real-life network datasets (with up to millions of nodes) verify that PPNE outperforms baselines by sacrificing less utility and obtaining higher privacy protection.

Foundations

The foundational work for this paper's niche, ranked by how specifically the neighbourhood builds on it — not by global fame.

Your Notes