LGCLMLOct 28, 2018

Semi-Supervised Translation with MMD Networks

arXiv:1810.11906v11 citations
Originality Incremental advance
AI Analysis

It addresses the challenge of language translation with limited labeled data, but appears incremental as it builds on existing semi-supervised and MMD techniques.

This work tackles the problem of semi-supervised learning in neural networks for language translation by introducing a hybrid cost function with a Maximum Mean Discrepancy (MMD) estimator, achieving improved performance with methods like supervised pre-initialization and multi-scale kernels.

This work aims to improve semi-supervised learning in a neural network architecture by introducing a hybrid supervised and unsupervised cost function. The unsupervised component is trained using a differentiable estimator of the Maximum Mean Discrepancy (MMD) distance between the network output and the target dataset. We introduce the notion of an $n$-channel network and several methods to improve performance of these nets based on supervised pre-initialization, and multi-scale kernels. This work investigates the effectiveness of these methods on language translation where very few quality translations are known \textit{a priori}. We also present a thorough investigation of the hyper-parameter space of this method on both synthetic data.

Foundations

The foundational work for this paper's niche, ranked by how specifically the neighbourhood builds on it — not by global fame.

Your Notes