IRCRJul 21, 2014

Privacy-Preserving Important Passage Retrieval

arXiv:1407.5416v13 citations
Originality Incremental advance
AI Analysis

This addresses privacy concerns for users or organizations needing secure document retrieval, but it is incremental as it adapts existing methods with privacy features.

The paper tackles the problem of retrieving important passages from documents while preserving privacy, achieving results similar to non-private methods on both clean and noisy text.

State-of-the-art important passage retrieval methods obtain very good results, but do not take into account privacy issues. In this paper, we present a privacy preserving method that relies on creating secure representations of documents. Our approach allows for third parties to retrieve important passages from documents without learning anything regarding their content. We use a hashing scheme known as Secure Binary Embeddings to convert a key phrase and bag-of-words representation to bit strings in a way that allows the computation of approximate distances, instead of exact ones. Experiments show that our secure system yield similar results to its non-private counterpart on both clean text and noisy speech recognized text.

Foundations

The foundational work for this paper's niche, ranked by how specifically the neighbourhood builds on it — not by global fame.

Your Notes