CLApr 30, 2018

Accelerating NMT Batched Beam Decoding with LMBR Posteriors for Deployment

arXiv:1804.11324v11091 citations
Originality Synthesis-oriented
AI Analysis

This work addresses deployment efficiency for NMT systems, but it appears incremental as it builds on existing LMBR and Transformer methods.

The paper tackled the problem of accelerating batched beam decoding for neural machine translation (NMT) by integrating LMBR n-gram posteriors, showing that LMBR techniques yield gains on top of recent Transformer results, with concrete numbers implied but not specified in the abstract.

We describe a batched beam decoding algorithm for NMT with LMBR n-gram posteriors, showing that LMBR techniques still yield gains on top of the best recently reported results with Transformers. We also discuss acceleration strategies for deployment, and the effect of the beam size and batching on memory and speed.

Foundations

The foundational work for this paper's niche, ranked by how specifically the neighbourhood builds on it — not by global fame.

Your Notes