CLJun 4, 2019

Lattice-Based Transformer Encoder for Neural Machine Translation

arXiv:1906.01282v11116 citations
Originality Incremental advance
AI Analysis

This work addresses a domain-specific issue in neural machine translation by integrating multiple segmentations, but it is incremental as it builds on existing Transformer methods.

The paper tackled the problem of segmentation diversity affecting neural machine translation performance by proposing lattice-based encoders for the Transformer model, resulting in improved translation performance over conventional encoders.

Neural machine translation (NMT) takes deterministic sequences for source representations. However, either word-level or subword-level segmentations have multiple choices to split a source sequence with different word segmentors or different subword vocabulary sizes. We hypothesize that the diversity in segmentations may affect the NMT performance. To integrate different segmentations with the state-of-the-art NMT model, Transformer, we propose lattice-based encoders to explore effective word or subword representation in an automatic way during training. We propose two methods: 1) lattice positional encoding and 2) lattice-aware self-attention. These two methods can be used together and show complementary to each other to further improve translation performance. Experiment results show superiorities of lattice-based encoders in word-level and subword-level representations over conventional Transformer encoder.

Foundations

The foundational work for this paper's niche, ranked by how specifically the neighbourhood builds on it — not by global fame.

Your Notes