CLMLSep 30, 2015

Polish to English Statistical Machine Translation

arXiv:1510.00001v12 citations
Originality Synthesis-oriented
AI Analysis

This work addresses translation challenges for Polish-English spoken language, but it is incremental as it applies existing methods to new data without major innovations.

The study investigated how different training configurations affect a Polish to English statistical machine translation system for spoken language, using multiple corpora and evaluation metrics to assess translation quality.

This research explores the effects of various training settings on a Polish to English Statistical Machine Translation system for spoken language. Various elements of the TED, Europarl, and OPUS parallel text corpora were used as the basis for training of language models, for development, tuning and testing of the translation system. The BLEU, NIST, METEOR and TER metrics were used to evaluate the effects of the data preparations on the translation results.

Foundations

The foundational work for this paper's niche, ranked by how specifically the neighbourhood builds on it — not by global fame.

Your Notes