CVLGMLJul 30, 2013

Scalable $k$-NN graph construction

arXiv:1307.7852v12 citations
Originality Incremental advance
AI Analysis

This addresses a bottleneck in data-driven techniques for learning and vision tasks, offering an incremental improvement in scalability.

The paper tackles the challenge of constructing k-NN graphs efficiently and accurately for large-scale high-dimensional data, proposing a hierarchical random division approach with neighborhood propagation that achieves significant speed-up on visual data.

The $k$-NN graph has played a central role in increasingly popular data-driven techniques for various learning and vision tasks; yet, finding an efficient and effective way to construct $k$-NN graphs remains a challenge, especially for large-scale high-dimensional data. In this paper, we propose a new approach to construct approximate $k$-NN graphs with emphasis in: efficiency and accuracy. We hierarchically and randomly divide the data points into subsets and build an exact neighborhood graph over each subset, achieving a base approximate neighborhood graph; we then repeat this process for several times to generate multiple neighborhood graphs, which are combined to yield a more accurate approximate neighborhood graph. Furthermore, we propose a neighborhood propagation scheme to further enhance the accuracy. We show both theoretical and empirical accuracy and efficiency of our approach to $k$-NN graph construction and demonstrate significant speed-up in dealing with large scale visual data.

Foundations

The foundational work for this paper's niche, ranked by how specifically the neighbourhood builds on it — not by global fame.

Your Notes