CLSep 22, 2025

Specification-Aware Machine Translation and Evaluation for Purpose Alignment

arXiv:2509.17559v12 citationsh-index: 13Proceedings of the Tenth Conference on Machine Translation
Originality Synthesis-oriented
AI Analysis

This addresses the gap between perceived and expected translation quality for professional translators and clients, though it is incremental in applying existing methods to a new domain.

The paper tackled the problem of machine translation in professional settings by integrating client specifications into the workflow, and found that specification-guided LLM translations consistently outperformed official human translations in expert evaluations.

In professional settings, translation is guided by communicative goals and client needs, often formalized as specifications. While existing evaluation frameworks acknowledge the importance of such specifications, these specifications are often treated only implicitly in machine translation (MT) research. Drawing on translation studies, we provide a theoretical rationale for why specifications matter in professional translation, as well as a practical guide to implementing specification-aware MT and evaluation. Building on this foundation, we apply our framework to the translation of investor relations texts from 33 publicly listed companies. In our experiment, we compare five translation types, including official human translations and prompt-based outputs from large language models (LLMs), using expert error analysis, user preference rankings, and an automatic metric. The results show that LLM translations guided by specifications consistently outperformed official human translations in human evaluations, highlighting a gap between perceived and expected quality. These findings demonstrate that integrating specifications into MT workflows, with human oversight, can improve translation quality in ways aligned with professional practice.

Foundations

The foundational work for this paper's niche, ranked by how specifically the neighbourhood builds on it — not by global fame.

Your Notes