Joint Reasoning on Hybrid-knowledge sources for Task-Oriented Dialog
This addresses the challenge of integrating hybrid knowledge sources for more flexible and robust task-oriented dialog systems, though it is incremental as it builds on existing pre-trained language models.
The paper tackles the problem of task-oriented dialog systems that rely on strict assumptions about knowledge sources, showing that current methods degrade significantly when these assumptions are removed. It proposes a BART-based model that fuses structured and unstructured knowledge without such assumptions, demonstrating robustness to knowledge modality perturbations.
Traditional systems designed for task oriented dialog utilize knowledge present only in structured knowledge sources to generate responses. However, relevant information required to generate responses may also reside in unstructured sources, such as documents. Recent state of the art models such as HyKnow and SeKnow aimed at overcoming these challenges make limiting assumptions about the knowledge sources. For instance, these systems assume that certain types of information, such as a phone number, is always present in a structured knowledge base (KB) while information about aspects such as entrance ticket prices, would always be available in documents. In this paper, we create a modified version of the MutliWOZ-based dataset prepared by SeKnow to demonstrate how current methods have significant degradation in performance when strict assumptions about the source of information are removed. Then, in line with recent work exploiting pre-trained language models, we fine-tune a BART based model using prompts for the tasks of querying knowledge sources, as well as, for response generation, without making assumptions about the information present in each knowledge source. Through a series of experiments, we demonstrate that our model is robust to perturbations to knowledge modality (source of information), and that it can fuse information from structured as well as unstructured knowledge to generate responses.