CLApr 18, 2021

Stream-level Latency Evaluation for Simultaneous Machine Translation

arXiv:2104.08817v2663 citations
AI Analysis

This work addresses latency evaluation for streaming applications in simultaneous machine translation, but it is incremental as it adapts existing measures rather than introducing a new paradigm.

The paper tackled the mismatch between sentence-level latency measures and real-time streaming in simultaneous machine translation by proposing a stream-level adaptation based on re-segmentation, which was successfully evaluated on an IWSLT task.

Simultaneous machine translation has recently gained traction thanks to significant quality improvements and the advent of streaming applications. Simultaneous translation systems need to find a trade-off between translation quality and response time, and with this purpose multiple latency measures have been proposed. However, latency evaluations for simultaneous translation are estimated at the sentence level, not taking into account the sequential nature of a streaming scenario. Indeed, these sentence-level latency measures are not well suited for continuous stream translation resulting in figures that are not coherent with the simultaneous translation policy of the system being assessed. This work proposes a stream-level adaptation of the current latency measures based on a re-segmentation approach applied to the output translation, that is successfully evaluated on streaming conditions for a reference IWSLT task.

Code Implementations1 repo
Foundations

The foundational work for this paper's niche, ranked by how specifically the neighbourhood builds on it — not by global fame.

Your Notes