CLJun 4, 2019

Lattice-Based Transformer Encoder for Neural Machine Translation

Fengshun Xiao, Jiangtong Li, Hai Zhao, Rui Wang, Kehai Chen

arXiv:1906.01282v131.21116 citations

Originality Incremental advance

AI Analysis

This work addresses a domain-specific issue in neural machine translation by integrating multiple segmentations, but it is incremental as it builds on existing Transformer methods.

The paper tackled the problem of segmentation diversity affecting neural machine translation performance by proposing lattice-based encoders for the Transformer model, resulting in improved translation performance over conventional encoders.

Neural machine translation (NMT) takes deterministic sequences for source representations. However, either word-level or subword-level segmentations have multiple choices to split a source sequence with different word segmentors or different subword vocabulary sizes. We hypothesize that the diversity in segmentations may affect the NMT performance. To integrate different segmentations with the state-of-the-art NMT model, Transformer, we propose lattice-based encoders to explore effective word or subword representation in an automatic way during training. We propose two methods: 1) lattice positional encoding and 2) lattice-aware self-attention. These two methods can be used together and show complementary to each other to further improve translation performance. Experiment results show superiorities of lattice-based encoders in word-level and subword-level representations over conventional Transformer encoder.

View on arXiv PDF

Similar