DBAICLFeb 19, 2024

Structure Guided Large Language Model for SQL Generation

arXiv:2402.13284v48 citationsh-index: 13ICML
Originality Incremental advance
AI Analysis

This addresses the challenge of enabling non-expert users to interact with databases via natural language, though it appears incremental as it builds on decomposition-based methods.

The paper tackles the problem of LLMs struggling with complex database structures and user intentions in SQL generation by proposing a structure-guided framework that uses syntax-based prompting, achieving consistent outperformance over state-of-the-art models on benchmark datasets.

Recent advancements in large language models (LLMs) have shown promise in bridging the gap between natural language queries and database management systems, enabling users to interact with databases without the background of SQL. However, LLMs often struggle to comprehend complex database structures and accurately interpret user intentions. Decomposition-based methods have been proposed to enhance the performance of LLMs on complex tasks, but decomposing SQL generation into subtasks is non-trivial due to the declarative structure of SQL syntax and the intricate connections between query concepts and database elements. In this paper, we propose a novel Structure GUided text-to-SQL framework~(SGU-SQL) that incorporates syntax-based prompting to enhance the SQL generation capabilities of LLMs. Specifically, SGU-SQL establishes structure-aware links between user queries and database schema and decomposes the complex generation task using syntax-based prompting to enable more accurate LLM-based SQL generation. Extensive experiments on two benchmark datasets demonstrate that SGU-SQL consistently outperforms state-of-the-art text-to-SQL models.

Foundations

The foundational work for this paper's niche, ranked by how specifically the neighbourhood builds on it — not by global fame.

Your Notes