CLOct 23, 2022

Towards Generalizable and Robust Text-to-SQL Parsing

arXiv:2210.12674v1294 citationsh-index: 29Has Code
Originality Incremental advance
AI Analysis

This work addresses the need for more reliable text-to-SQL systems in real-world applications, though it is incremental as it builds on existing methods to enhance specific capabilities.

The paper tackles the problem of improving the generalizability and robustness of text-to-SQL parsers across various challenging scenarios, achieving state-of-the-art performance on datasets like Spider, SParC, and CoSQL.

Text-to-SQL parsing tackles the problem of mapping natural language questions to executable SQL queries. In practice, text-to-SQL parsers often encounter various challenging scenarios, requiring them to be generalizable and robust. While most existing work addresses a particular generalization or robustness challenge, we aim to study it in a more comprehensive manner. In specific, we believe that text-to-SQL parsers should be (1) generalizable at three levels of generalization, namely i.i.d., zero-shot, and compositional, and (2) robust against input perturbations. To enhance these capabilities of the parser, we propose a novel TKK framework consisting of Task decomposition, Knowledge acquisition, and Knowledge composition to learn text-to-SQL parsing in stages. By dividing the learning process into multiple stages, our framework improves the parser's ability to acquire general SQL knowledge instead of capturing spurious patterns, making it more generalizable and robust. Experimental results under various generalization and robustness settings show that our framework is effective in all scenarios and achieves state-of-the-art performance on the Spider, SParC, and CoSQL datasets. Code can be found at https://github.com/AlibabaResearch/DAMO-ConvAI/tree/main/tkk.

Code Implementations1 repo
Foundations

The foundational work for this paper's niche, ranked by how specifically the neighbourhood builds on it — not by global fame.

Your Notes