CLLGAug 6, 2024

Conditioning LLMs with Emotion in Neural Machine Translation

arXiv:2408.03150v127 citationsh-index: 14
Originality Incremental advance
AI Analysis

This work addresses enhancing translation quality for users by incorporating emotion, but it is incremental as it builds on existing LLMs and SER models.

The authors tackled improving machine translation by integrating emotion information from speech into large language models, resulting in notable improvements in translation quality, especially with arousal.

Large Language Models (LLMs) have shown remarkable performance in Natural Language Processing tasks, including Machine Translation (MT). In this work, we propose a novel MT pipeline that integrates emotion information extracted from a Speech Emotion Recognition (SER) model into LLMs to enhance translation quality. We first fine-tune five existing LLMs on the Libri-trans dataset and select the most performant model. Subsequently, we augment LLM prompts with different dimensional emotions and train the selected LLM under these different configurations. Our experiments reveal that integrating emotion information, especially arousal, into LLM prompts leads to notable improvements in translation quality.

Foundations

The foundational work for this paper's niche, ranked by how specifically the neighbourhood builds on it — not by global fame.

Your Notes