CLDec 22, 2014

Pragmatic Neural Language Modelling in Machine Translation

arXiv:1412.7119v334 citations
Originality Synthesis-oriented
AI Analysis

This work addresses the challenge of scaling neural language models for practical machine translation applications, though it is incremental in nature.

This paper investigates how to effectively integrate neural language models into machine translation systems, finding that while neural models are suitable for memory-constrained environments, they still underperform traditional models in raw translation quality.

This paper presents an in-depth investigation on integrating neural language models in translation systems. Scaling neural language models is a difficult task, but crucial for real-world applications. This paper evaluates the impact on end-to-end MT quality of both new and existing scaling techniques. We show when explicitly normalising neural models is necessary and what optimisation tricks one should use in such scenarios. We also focus on scalable training algorithms and investigate noise contrastive estimation and diagonal contexts as sources for further speed improvements. We explore the trade-offs between neural models and back-off n-gram models and find that neural models make strong candidates for natural language applications in memory constrained environments, yet still lag behind traditional models in raw translation quality. We conclude with a set of recommendations one should follow to build a scalable neural language model for MT.

Foundations

The foundational work for this paper's niche, ranked by how specifically the neighbourhood builds on it — not by global fame.

Your Notes