DBAIDec 1, 2024

CONCERTO: Complex Query Execution Mechanism-Aware Learned Cost Estimation

arXiv:2412.00749v2h-index: 4
Originality Incremental advance
AI Analysis

This addresses a bottleneck in database management systems for data analysts, but it is incremental as it builds on existing learned cost estimation approaches.

The paper tackles the problem of predicting query performance in DBMSs with complex execution mechanisms like vectorized operators and parallel execution, proposing CONCERTO, which achieves higher prediction accuracy than existing methods.

With the growing demand for massive data analysis, many DBMSs have adopted complex underlying query execution mechanisms, including vectorized operators, parallel execution, and dynamic pipeline modifications. However, there remains a lack of targeted Query Performance Prediction (QPP) methods for these complex execution mechanisms and their interactions, as most existing approaches focus on traditional tree-shaped query plans and static serial executors. To address this challenge, this paper proposes CONCERTO, a Complex query executiON meChanism-awaE leaRned cosT estimatiOn method. CONCERTO first establishes independent resource cost models for each physical operator. It then constructs a Directed Acyclic Graph (DAG) consisting of a dataflow tree backbone and resource competition relationships among concurrent operators. After calibrating the cost impact of parallel operator execution using Graph Attention Networks (GATs) with additional attention mechanisms, CONCERTO extracts and aggregates cost vector trees through Temporal Convolutional Networks (TCNs), ultimately achieving effective query performance prediction. Experimental results demonstrate that CONCERTO achieves higher prediction accuracy than existing methods.

Foundations

The foundational work for this paper's niche, ranked by how specifically the neighbourhood builds on it — not by global fame.

Your Notes