CLJul 19, 2017

Sentence-level quality estimation by predicting HTER as a multi-component metric

arXiv:1707.06167v11089 citations
Originality Synthesis-oriented
AI Analysis

This is an incremental improvement for machine translation quality estimation, focusing on sentence-level HTER prediction.

The paper tackled sentence-level quality estimation by predicting HTER as a multi-component metric, using a model that jointly predicts four post-editing operations to calculate HTER, resulting in small but significant improvements over the baseline.

This submission investigates alternative machine learning models for predicting the HTER score on the sentence level. Instead of directly predicting the HTER score, we suggest a model that jointly predicts the amount of the 4 distinct post-editing operations, which are then used to calculate the HTER score. This also gives the possibility to correct invalid (e.g. negative) predicted values prior to the calculation of the HTER score. Without any feature exploration, a multi-layer perceptron with 4 outputs yields small but significant improvements over the baseline.

Foundations

The foundational work for this paper's niche, ranked by how specifically the neighbourhood builds on it — not by global fame.

Your Notes