SE AI CLMar 29, 2022

Quality Assurance of Generative Dialog Models in an Evolving Conversational Agent Used for Swedish Language Practice

Markus Borg, Johan Bengtsson, Harald Österling, Alexander Hagelborn, Isabella Gagner, Piotr Tomaszewski

arXiv:2203.15414v12.33 citationsh-index: 22

Originality Synthesis-oriented

AI Analysis

This work addresses the problem of ensuring model quality for language learners in migration contexts, but it is incremental as it provides initial steps toward an automated framework.

The researchers tackled quality assurance for generative dialog models used in a Swedish language practice conversational agent by designing automated test cases for 15 out of 38 requirements, finding that six test cases could detect meaningful differences between candidate models.

Due to the migration megatrend, efficient and effective second-language acquisition is vital. One proposed solution involves AI-enabled conversational agents for person-centered interactive language practice. We present results from ongoing action research targeting quality assurance of proprietary generative dialog models trained for virtual job interviews. The action team elicited a set of 38 requirements for which we designed corresponding automated test cases for 15 of particular interest to the evolving solution. Our results show that six of the test case designs can detect meaningful differences between candidate models. While quality assurance of natural language processing applications is complex, we provide initial steps toward an automated framework for machine learning model selection in the context of an evolving conversational agent. Future work will focus on model selection in an MLOps setting.

View on arXiv PDF

Similar