CLAIOct 21, 2020

TurnGPT: a Transformer-based Language Model for Predicting Turn-taking in Spoken Dialog

arXiv:2010.10874v11003 citations
Originality Incremental advance
AI Analysis

This work addresses turn-taking prediction for spoken dialog systems, but it is incremental as it builds on existing transformer methods with specific adaptations.

The authors tackled the problem of predicting turn-taking in spoken dialog by introducing TurnGPT, a transformer-based language model, which outperformed prior baselines on various datasets.

Syntactic and pragmatic completeness is known to be important for turn-taking prediction, but so far machine learning models of turn-taking have used such linguistic information in a limited way. In this paper, we introduce TurnGPT, a transformer-based language model for predicting turn-shifts in spoken dialog. The model has been trained and evaluated on a variety of written and spoken dialog datasets. We show that the model outperforms two baselines used in prior work. We also report on an ablation study, as well as attention and gradient analyses, which show that the model is able to utilize the dialog context and pragmatic completeness for turn-taking prediction. Finally, we explore the model's potential in not only detecting, but also projecting, turn-completions.

Code Implementations1 repo
Foundations

The foundational work for this paper's niche, ranked by how specifically the neighbourhood builds on it — not by global fame.

Your Notes