CLAIMay 3, 2023

End-to-end Training and Decoding for Pivot-based Cascaded Translation Model

arXiv:2305.02261v1
Originality Incremental advance
AI Analysis

This work addresses machine translation for low-resource language pairs, though it appears incremental as it builds on existing pivot-based approaches.

The paper tackles the problem of low-resource machine translation by proposing an end-to-end training method and improved decoding algorithm for pivot-based cascaded models, which enhances translation quality as demonstrated in experiments.

Utilizing pivot language effectively can significantly improve low-resource machine translation. Usually, the two translation models, source-pivot and pivot-target, are trained individually and do not utilize the limited (source, target) parallel data. This work proposes an end-to-end training method for the cascaded translation model and configures an improved decoding algorithm. The input of the pivot-target model is modified to weighted pivot embedding based on the probability distribution output by the source-pivot model. This allows the model to be trained end-to-end. In addition, we mitigate the inconsistency between tokens and probability distributions while using beam search in pivot decoding. Experiments demonstrate that our method enhances the quality of translation.

Code Implementations1 repo
Foundations

The foundational work for this paper's niche, ranked by how specifically the neighbourhood builds on it — not by global fame.

Your Notes