CLMay 2, 2017

A Teacher-Student Framework for Zero-Resource Neural Machine Translation

arXiv:1705.00753v1155 citations
Originality Highly original
AI Analysis

This addresses the problem of zero-resource translation for low-resource language pairs, offering a novel method but with incremental gains.

The paper tackles the data scarcity problem in neural machine translation for low-resource language pairs by proposing a teacher-student framework that trains a source-to-target model without parallel corpora, achieving a +3.0 BLEU improvement over a baseline pivot-based model.

While end-to-end neural machine translation (NMT) has made remarkable progress recently, it still suffers from the data scarcity problem for low-resource language pairs and domains. In this paper, we propose a method for zero-resource NMT by assuming that parallel sentences have close probabilities of generating a sentence in a third language. Based on this assumption, our method is able to train a source-to-target NMT model ("student") without parallel corpora available, guided by an existing pivot-to-target NMT model ("teacher") on a source-pivot parallel corpus. Experimental results show that the proposed method significantly improves over a baseline pivot-based model by +3.0 BLEU points across various language pairs.

Foundations

The foundational work for this paper's niche, ranked by how specifically the neighbourhood builds on it — not by global fame.

Your Notes