CLJul 21, 2024

A Survey on Employing Large Language Models for Text-to-SQL Tasks

arXiv:2407.15186v5119 citationsh-index: 4
Originality Synthesis-oriented
AI Analysis

It synthesizes existing research for practitioners in natural language processing and database querying, but is incremental as a survey paper.

This survey comprehensively reviews LLM-based Text-to-SQL methods, categorizing prompt engineering and finetuning approaches while analyzing their performance on standard benchmarks.

With the development of the Large Language Models (LLMs), a large range of LLM-based Text-to-SQL(Text2SQL) methods have emerged. This survey provides a comprehensive review of LLM-based Text2SQL studies. We first enumerate classic benchmarks and evaluation metrics. For the two mainstream methods, prompt engineering and finetuning, we introduce a comprehensive taxonomy and offer practical insights into each subcategory. We present an overall analysis of the above methods and various models evaluated on well-known datasets and extract some characteristics. Finally, we discuss the challenges and future directions in this field.

Foundations

The foundational work for this paper's niche, ranked by how specifically the neighbourhood builds on it — not by global fame.

Your Notes