DBCLMar 26, 2022

Demonstrating CAT: Synthesizing Data-Aware Conversational Agents for Transactional Databases

arXiv:2203.14144v11 citationsh-index: 31Has Code
Originality Incremental advance
AI Analysis

This addresses the problem of reducing data and expertise requirements for developers building chatbots for applications like booking systems, though it is incremental as it builds on existing conversational agent methods.

The paper tackles the challenge of creating conversational agents for transactional databases by introducing CAT, which synthesizes training data using weak supervision and integrates the agent with the database, resulting in more efficient dialogues due to data-aware decision-making.

Databases for OLTP are often the backbone for applications such as hotel room or cinema ticket booking applications. However, developing a conversational agent (i.e., a chatbot-like interface) to allow end-users to interact with an application using natural language requires both immense amounts of training data and NLP expertise. This motivates CAT, which can be used to easily create conversational agents for transactional databases. The main idea is that, for a given OLTP database, CAT uses weak supervision to synthesize the required training data to train a state-of-the-art conversational agent, allowing users to interact with the OLTP database. Furthermore, CAT provides an out-of-the-box integration of the resulting agent with the database. As a major difference to existing conversational agents, agents synthesized by CAT are data-aware. This means that the agent decides which information should be requested from the user based on the current data distributions in the database, which typically results in markedly more efficient dialogues compared with non-data-aware agents. We publish the code for CAT as open source.

Foundations

The foundational work for this paper's niche, ranked by how specifically the neighbourhood builds on it — not by global fame.

Your Notes