LGAICVDBDSFeb 17, 2024

Probabilistic Routing for Graph-Based Approximate Nearest Neighbor Search

arXiv:2402.11354v29 citationsh-index: 21ICML
Originality Highly original
AI Analysis

This work addresses a key bottleneck in high-dimensional search for machine learning applications, offering a significant efficiency improvement over existing methods.

The paper tackled the problem of enhancing routing in graph-based approximate nearest neighbor search by introducing a method with probabilistic guarantees, resulting in a throughput increase of 1.6 to 2.5 times on common graph indexes and outperforming leading-edge routing by 1.1 to 1.4 times.

Approximate nearest neighbor search (ANNS) in high-dimensional spaces is a pivotal challenge in the field of machine learning. In recent years, graph-based methods have emerged as the superior approach to ANNS, establishing a new state of the art. Although various optimizations for graph-based ANNS have been introduced, they predominantly rely on heuristic methods that lack formal theoretical backing. This paper aims to enhance routing within graph-based ANNS by introducing a method that offers a probabilistic guarantee when exploring a node's neighbors in the graph. We formulate the problem as probabilistic routing and develop two baseline strategies by incorporating locality-sensitive techniques. Subsequently, we introduce PEOs, a novel approach that efficiently identifies which neighbors in the graph should be considered for exact distance calculation, thus significantly improving efficiency in practice. Our experiments demonstrate that equipping PEOs can increase throughput on commonly utilized graph indexes (HNSW and NSSG) by a factor of 1.6 to 2.5, and its efficiency consistently outperforms the leading-edge routing technique by 1.1 to 1.4 times.

Code Implementations1 repo
Foundations

The foundational work for this paper's niche, ranked by how specifically the neighbourhood builds on it — not by global fame.

Your Notes