CLJul 21, 2024

A Survey on Employing Large Language Models for Text-to-SQL Tasks

Liang Shi, Zhengju Tang, Nan Zhang, Xiaotong Zhang, Zhi Yang

arXiv:2407.15186v521.5130 citationsh-index: 4

Originality Synthesis-oriented

AI Analysis

It synthesizes existing research for practitioners in natural language processing and database querying, but is incremental as a survey paper.

This survey comprehensively reviews LLM-based Text-to-SQL methods, categorizing prompt engineering and finetuning approaches while analyzing their performance on standard benchmarks.

With the development of the Large Language Models (LLMs), a large range of LLM-based Text-to-SQL(Text2SQL) methods have emerged. This survey provides a comprehensive review of LLM-based Text2SQL studies. We first enumerate classic benchmarks and evaluation metrics. For the two mainstream methods, prompt engineering and finetuning, we introduce a comprehensive taxonomy and offer practical insights into each subcategory. We present an overall analysis of the above methods and various models evaluated on well-known datasets and extract some characteristics. Finally, we discuss the challenges and future directions in this field.

View on arXiv PDF

Similar