CVAug 9, 2017

Random Binary Trees for Approximate Nearest Neighbour Search in Binary Space

arXiv:1708.02976v28 citations
Originality Incremental advance
AI Analysis

This addresses the problem of efficient and accurate nearest neighbor search in binary spaces for applications like data mining and computer vision, representing an incremental improvement over existing methods.

The paper tackles approximate nearest neighbor search for high-dimensional binary vectors by proposing Random Binary Search Trees (RBST), achieving over 20% higher retrieval precision compared to state-of-the-art Locality Sensitive Hashing variants on a dataset of 1.25M binary descriptors from Google's Project Tango.

Approximate nearest neighbour (ANN) search is one of the most important problems in computer science fields such as data mining or computer vision. In this paper, we focus on ANN for high-dimensional binary vectors and we propose a simple yet powerful search method that uses Random Binary Search Trees (RBST). We apply our method to a dataset of 1.25M binary local feature descriptors obtained from a real-life image-based localisation system provided by Google as a part of Project Tango. An extensive evaluation of our method against the state-of-the-art variations of Locality Sensitive Hashing (LSH), namely Uniform LSH and Multi-probe LSH, shows the superiority of our method in terms of retrieval precision with performance boost of over 20%

Foundations

The foundational work for this paper's niche, ranked by how specifically the neighbourhood builds on it — not by global fame.

Your Notes