CRLGMar 5, 2025

Data Poisoning Attacks to Locally Differentially Private Range Query Protocols

arXiv:2503.03454v22 citationsh-index: 6
Originality Highly original
AI Analysis

This work addresses a security problem for decentralized data collection systems, revealing a critical vulnerability in privacy-preserving protocols, but it is incremental as it builds on known attack types in LDP.

The paper tackles the vulnerability of locally differentially private range query protocols to data poisoning attacks, showing that malicious users can manipulate aggregated results by crafting fake data, with attacks amplifying estimations by 5-10 times more than normal users.

Local Differential Privacy (LDP) has been widely adopted to protect user privacy in decentralized data collection. However, recent studies have revealed that LDP protocols are vulnerable to data poisoning attacks, where malicious users manipulate their reported data to distort aggregated results. In this work, we present the first study on data poisoning attacks targeting LDP range query protocols, focusing on both tree-based and grid-based approaches. We identify three key challenges in executing such attacks, including crafting consistent and effective fake data, maintaining data consistency across levels or grids, and preventing server detection. To address the first two challenges, we propose novel attack methods that are provably optimal, including a tree-based attack and a grid-based attack, designed to manipulate range query results with high effectiveness. \textbf{Our key finding is that the common post-processing procedure, Norm-Sub, in LDP range query protocols can help the attacker massively amplify their attack effectiveness.} In addition, we study a potential countermeasure, but also propose an adaptive attack capable of evading this defense to address the third challenge. We evaluate our methods through theoretical analysis and extensive experiments on synthetic and real-world datasets. Our results show that the proposed attacks can significantly amplify estimations for arbitrary range queries by manipulating a small fraction of users, providing 5-10x more influence than a normal user to the estimation.

Foundations

The foundational work for this paper's niche, ranked by how specifically the neighbourhood builds on it — not by global fame.

Your Notes