Generate, Evaluate, and Select: A Dialogue System with a Response Evaluator for Diversity-Aware Response Generation
This addresses the problem of unengaging conversational partners in dialogue systems, but it is incremental as it builds on existing generator-evaluator approaches.
The authors tackled the lack of diversity in dialogue system responses by proposing a generator-evaluator model that generates multiple responses and selects the best one, with human evaluations showing the proposed system's responses were often judged better than a baseline.
We aim to overcome the lack of diversity in responses of current dialogue systems and to develop a dialogue system that is engaging as a conversational partner. We propose a generator-evaluator model that evaluates multiple responses generated by a response generator and selects the best response by an evaluator. By generating multiple responses, we obtain diverse responses. We conduct human evaluations to compare the output of the proposed system with that of a baseline system. The results of the human evaluations showed that the proposed system's responses were often judged to be better than the baseline system's, and indicated the effectiveness of the proposed method.