LGAIDBOct 1, 2025

Panorama: Fast-Track Nearest Neighbors

arXiv:2510.00566v31 citationsh-index: 7
Originality Incremental advance
AI Analysis

This addresses a critical performance bottleneck in ANNS systems used in recommendation systems, image/video retrieval, NLP, and RAG, though it is an incremental improvement that integrates with existing methods.

The paper tackles the verification bottleneck in Approximate Nearest-Neighbor Search (ANNS) systems, where up to 99% of query time is spent computing distances in the refinement phase, and presents PANORAMA, a machine learning-driven approach using learned orthogonal transforms that achieves 2-30× end-to-end speedup with no recall loss across diverse datasets.

Approximate Nearest-Neighbor Search (ANNS) efficiently finds data items whose embeddings are close to that of a given query in a high-dimensional space, aiming to balance accuracy with speed. Used in recommendation systems, image and video retrieval, natural language processing, and retrieval-augmented generation (RAG), ANNS algorithms such as IVFPQ, HNSW graphs, Annoy, and MRPT utilize graph, tree, clustering, and quantization techniques to navigate large vector spaces. Despite this progress, ANNS systems spend up to 99% of query time to compute distances in their final refinement phase. In this paper, we present PANORAMA, a machine learning-driven approach that tackles the ANNS verification bottleneck through data-adaptive learned orthogonal transforms that facilitate the accretive refinement of distance bounds. Such transforms compact over 90% of signal energy into the first half of dimensions, enabling early candidate pruning with partial distance computations. We integrate PANORAMA into state-of-the-art ANNS methods, namely IVFPQ/Flat, HNSW, MRPT, and Annoy, without index modification, using level-major memory layouts, SIMD-vectorized partial distance computations, and cache-aware access patterns. Experiments across diverse datasets -- from image-based CIFAR-10 and GIST to modern embedding spaces including OpenAI's Ada 2 and Large 3 -- demonstrate that PANORAMA affords a 2--30$\times$ end-to-end speedup with no recall loss.

Foundations

The foundational work for this paper's niche, ranked by how specifically the neighbourhood builds on it — not by global fame.

Your Notes