CLLGMLDec 15, 2017

Sockeye: A Toolkit for Neural Machine Translation

arXiv:1712.05690v2217 citationsHas Code
Originality Synthesis-oriented
AI Analysis

This toolkit provides a production-ready and experimental platform for researchers and practitioners in machine translation, though it is incremental as it builds on existing architectures and methods.

The authors introduced Sockeye, an open-source toolkit for neural machine translation that supports multiple encoder-decoder architectures, and benchmarked it on WMT tasks, achieving competitive BLEU scores including a best score for its transformer implementation.

We describe Sockeye (version 1.12), an open-source sequence-to-sequence toolkit for Neural Machine Translation (NMT). Sockeye is a production-ready framework for training and applying models as well as an experimental platform for researchers. Written in Python and built on MXNet, the toolkit offers scalable training and inference for the three most prominent encoder-decoder architectures: attentional recurrent neural networks, self-attentional transformers, and fully convolutional networks. Sockeye also supports a wide range of optimizers, normalization and regularization techniques, and inference improvements from current NMT literature. Users can easily run standard training recipes, explore different model settings, and incorporate new ideas. In this paper, we highlight Sockeye's features and benchmark it against other NMT toolkits on two language arcs from the 2017 Conference on Machine Translation (WMT): English-German and Latvian-English. We report competitive BLEU scores across all three architectures, including an overall best score for Sockeye's transformer implementation. To facilitate further comparison, we release all system outputs and training scripts used in our experiments. The Sockeye toolkit is free software released under the Apache 2.0 license.

Code Implementations16 repos

Data from Papers with Code (CC-BY-SA-4.0)

Foundations

The foundational work for this paper's niche, ranked by how specifically the neighbourhood builds on it — not by global fame.

Your Notes