CLNov 1, 2020

Improving Conversational Question Answering Systems after Deployment using Feedback-Weighted Learning

Jon Ander Campos, Kyunghyun Cho, Arantxa Otegi, Aitor Soroa, Gorka Azkune, Eneko Agirre

arXiv:2011.00615v131.0992 citationsHas Code

Originality Incremental advance

AI Analysis

This work addresses the challenge of enhancing conversational AI systems post-deployment for developers and users, though it is incremental as it builds on existing feedback methods.

The paper tackles the problem of improving conversational question answering systems after deployment using only binary user feedback, achieving results close to fully-supervised systems in in-domain experiments and matching them in out-of-domain scenarios.

The interaction of conversational systems with users poses an exciting opportunity for improving them after deployment, but little evidence has been provided of its feasibility. In most applications, users are not able to provide the correct answer to the system, but they are able to provide binary (correct, incorrect) feedback. In this paper we propose feedback-weighted learning based on importance sampling to improve upon an initial supervised system using binary user feedback. We perform simulated experiments on document classification (for development) and Conversational Question Answering datasets like QuAC and DoQA, where binary user feedback is derived from gold annotations. The results show that our method is able to improve over the initial supervised system, getting close to a fully-supervised system that has access to the same labeled examples in in-domain experiments (QuAC), and even matching in out-of-domain experiments (DoQA). Our work opens the prospect to exploit interactions with real users and improve conversational systems after deployment.

View on arXiv PDF Code

Similar