CV LG PFSep 11, 2018

Zoom: SSD-based Vector Search for Optimizing Accuracy, Latency and Memory

arXiv:1809.04067v15 citations

Originality Highly original

AI Analysis

This addresses the need for efficient and accurate vector search in information retrieval systems, representing a novel method rather than incremental.

The paper tackles the problem of vector search needing high accuracy, low latency, and low memory, and develops Zoom, which achieves order-of-magnitude efficiency improvements while matching or exceeding state-of-the-art accuracy.

With the advancement of machine learning and deep learning, vector search becomes instrumental to many information retrieval systems, to search and find best matches to user queries based on their semantic similarities.These online services require the search architecture to be both effective with high accuracy and efficient with low latency and memory footprint, which existing work fails to offer. We develop, Zoom, a new vector search solution that collaboratively optimizes accuracy, latency and memory based on a multiview approach. (1) A "preview" step generates a small set of good candidates, leveraging compressed vectors in memory for reduced footprint and fast lookup. (2) A "fullview" step on SSDs reranks those candidates with their full-length vector, striking high accuracy. Our evaluation shows that, Zoom achieves an order of magnitude improvements on efficiency while attaining equal or higher accuracy, comparing with the state-of-the-art.

View on arXiv PDF

Similar