CLAug 10, 2015

Feature-based Decipherment for Large Vocabulary Machine Translation

arXiv:1508.02142v13 citations
Originality Incremental advance
AI Analysis

This work addresses machine translation for closely related languages, but it is incremental as it builds on existing decipherment models with feature-based improvements.

The authors tackled the problem of machine translation for closely related languages by developing a log-linear model with orthographic similarity features, which outperformed existing generative decipherment models and scaled to large vocabularies.

Orthographic similarities across languages provide a strong signal for probabilistic decipherment, especially for closely related language pairs. The existing decipherment models, however, are not well-suited for exploiting these orthographic similarities. We propose a log-linear model with latent variables that incorporates orthographic similarity features. Maximum likelihood training is computationally expensive for the proposed log-linear model. To address this challenge, we perform approximate inference via MCMC sampling and contrastive divergence. Our results show that the proposed log-linear model with contrastive divergence scales to large vocabularies and outperforms the existing generative decipherment models by exploiting the orthographic features.

Foundations

The foundational work for this paper's niche, ranked by how specifically the neighbourhood builds on it — not by global fame.

Your Notes