DBLGPFApr 16, 2024

VDTuner: Automated Performance Tuning for Vector Data Management Systems

arXiv:2404.10413v113 citationsh-index: 17Has CodeICDE
Originality Incremental advance
AI Analysis

This addresses performance optimization for users of vector data management systems in large-scale information retrieval and machine learning, though it is incremental as it builds on existing auto-tuning methods with a novel approach for this specific bottleneck.

The paper tackles the problem of automatic performance tuning for vector data management systems, which is challenging due to complex parameter spaces, and introduces VDTuner, a learning-based framework using multi-objective Bayesian optimization that improves search speed by 14.12% and recall rate by 186.38% compared to default settings.

Vector data management systems (VDMSs) have become an indispensable cornerstone in large-scale information retrieval and machine learning systems like large language models. To enhance the efficiency and flexibility of similarity search, VDMS exposes many tunable index parameters and system parameters for users to specify. However, due to the inherent characteristics of VDMS, automatic performance tuning for VDMS faces several critical challenges, which cannot be well addressed by the existing auto-tuning methods. In this paper, we introduce VDTuner, a learning-based automatic performance tuning framework for VDMS, leveraging multi-objective Bayesian optimization. VDTuner overcomes the challenges associated with VDMS by efficiently exploring a complex multi-dimensional parameter space without requiring any prior knowledge. Moreover, it is able to achieve a good balance between search speed and recall rate, delivering an optimal configuration. Extensive evaluations demonstrate that VDTuner can markedly improve VDMS performance (14.12% in search speed and 186.38% in recall rate) compared with default setting, and is more efficient compared with state-of-the-art baselines (up to 3.57 times faster in terms of tuning time). In addition, VDTuner is scalable to specific user preference and cost-aware optimization objective. VDTuner is available online at https://github.com/tiannuo-yang/VDTuner.

Code Implementations1 repo
Foundations

The foundational work for this paper's niche, ranked by how specifically the neighbourhood builds on it — not by global fame.

Your Notes