CodexDB: Generating Code for Processing SQL Queries using GPT-3 Codex
This work addresses the challenge of customizing SQL query processing for users through natural language, though it is an incremental application of existing models to a specific domain.
The authors tackled the problem of processing SQL queries by developing CodexDB, a framework that uses GPT-3 Codex to translate natural language instructions into code, achieving correct code generation for a majority of queries in the WikiSQL benchmark.
CodexDB is an SQL processing engine whose internals can be customized via natural language instructions. CodexDB is based on OpenAI's GPT-3 Codex model which translates text into code. It is a framework on top of GPT-3 Codex that decomposes complex SQL queries into a series of simple processing steps, described in natural language. Processing steps are enriched with user-provided instructions and descriptions of database properties. Codex translates the resulting text into query processing code. An early prototype of CodexDB is able to generate correct code for a majority of queries of the WikiSQL benchmark and can be customized in various ways.