CLAug 28, 2018

Privacy-preserving Neural Representations of Text

arXiv:1808.09408v11142 citations
Originality Incremental advance
AI Analysis

This addresses privacy risks in distributed NLP systems, such as cloud-based models, but is incremental in proposing specific defenses for a known attack scenario.

The paper tackles the problem of adversarial attacks on neural text classifiers where attackers eavesdrop on hidden representations to recover private input information, and proposes defense methods that improve privacy while characterizing the tradeoff with utility.

This article deals with adversarial attacks towards deep learning systems for Natural Language Processing (NLP), in the context of privacy protection. We study a specific type of attack: an attacker eavesdrops on the hidden representations of a neural text classifier and tries to recover information about the input text. Such scenario may arise in situations when the computation of a neural network is shared across multiple devices, e.g. some hidden representation is computed by a user's device and sent to a cloud-based model. We measure the privacy of a hidden representation by the ability of an attacker to predict accurately specific private information from it and characterize the tradeoff between the privacy and the utility of neural representations. Finally, we propose several defense methods based on modified training objectives and show that they improve the privacy of neural representations.

Code Implementations1 repo
Foundations

The foundational work for this paper's niche, ranked by how specifically the neighbourhood builds on it — not by global fame.

Your Notes