CLSep 2, 2021

Towards Making the Most of Dialogue Characteristics for Neural Chat Translation

arXiv:2109.00668v1666 citations
Originality Incremental advance
AI Analysis

This work addresses the challenge of translating conversational text more accurately for users of multilingual chat systems, representing an incremental improvement over existing methods.

The paper tackles the problem of neural chat translation by modeling dialogue characteristics like coherence and speaker personality, achieving improved translation quality as verified by experiments on English-German and English-Chinese language pairs.

Neural Chat Translation (NCT) aims to translate conversational text between speakers of different languages. Despite the promising performance of sentence-level and context-aware neural machine translation models, there still remain limitations in current NCT models because the inherent dialogue characteristics of chat, such as dialogue coherence and speaker personality, are neglected. In this paper, we propose to promote the chat translation by introducing the modeling of dialogue characteristics into the NCT model. To this end, we design four auxiliary tasks including monolingual response generation, cross-lingual response generation, next utterance discrimination, and speaker identification. Together with the main chat translation task, we optimize the NCT model through the training objectives of all these tasks. By this means, the NCT model can be enhanced by capturing the inherent dialogue characteristics, thus generating more coherent and speaker-relevant translations. Comprehensive experiments on four language directions (English-German and English-Chinese) verify the effectiveness and superiority of the proposed approach.

Code Implementations1 repo
Foundations

The foundational work for this paper's niche, ranked by how specifically the neighbourhood builds on it — not by global fame.

Your Notes