CLNov 26, 2019

Semi-supervised Bootstrapping of Dialogue State Trackers for Task Oriented Modelling

Bo-Hsiang Tseng, Marek Rei, Paweł Budzianowski, Richard E. Turner, Bill Byrne, Anna Korhonen

arXiv:1911.11672v10.2

Originality Incremental advance

AI Analysis

This addresses the annotation bottleneck for dialogue system developers, though it appears incremental as it builds on existing semi-supervised methods.

The paper tackles the problem of expensive annotation for dialogue systems by investigating semi-supervised learning to reduce required intermediate labeling, finding that annotations can be reduced by up to 30% while maintaining equivalent performance on the MultiWOZ corpus.

Dialogue systems benefit greatly from optimizing on detailed annotations, such as transcribed utterances, internal dialogue state representations and dialogue act labels. However, collecting these annotations is expensive and time-consuming, holding back development in the area of dialogue modelling. In this paper, we investigate semi-supervised learning methods that are able to reduce the amount of required intermediate labelling. We find that by leveraging un-annotated data instead, the amount of turn-level annotations of dialogue state can be significantly reduced when building a neural dialogue system. Our analysis on the MultiWOZ corpus, covering a range of domains and topics, finds that annotations can be reduced by up to 30\% while maintaining equivalent system performance. We also describe and evaluate the first end-to-end dialogue model created for the MultiWOZ corpus.

View on arXiv PDF

Similar