CVNov 29, 2017

Online Product Quantization

arXiv:1711.10775v250 citations
Originality Incremental advance
AI Analysis

This addresses the challenge of handling large-scale, dynamically changing databases for tasks like search and retrieval, though it is incremental as it builds on existing product quantization methods.

The paper tackles the problem of approximate nearest neighbor search in dynamic databases where data distribution evolves over time, by developing an online product quantization model that incrementally updates the quantization codebook to handle streaming data, and experiments show it is time-efficient and effective compared to baseline methods.

Approximate nearest neighbor (ANN) search has achieved great success in many tasks. However, existing popular methods for ANN search, such as hashing and quantization methods, are designed for static databases only. They cannot handle well the database with data distribution evolving dynamically, due to the high computational effort for retraining the model based on the new database. In this paper, we address the problem by developing an online product quantization (online PQ) model and incrementally updating the quantization codebook that accommodates to the incoming streaming data. Moreover, to further alleviate the issue of large scale computation for the online PQ update, we design two budget constraints for the model to update partial PQ codebook instead of all. We derive a loss bound which guarantees the performance of our online PQ model. Furthermore, we develop an online PQ model over a sliding window with both data insertion and deletion supported, to reflect the real-time behaviour of the data. The experiments demonstrate that our online PQ model is both time-efficient and effective for ANN search in dynamic large scale databases compared with baseline methods and the idea of partial PQ codebook update further reduces the update cost.

Foundations

The foundational work for this paper's niche, ranked by how specifically the neighbourhood builds on it — not by global fame.

Your Notes