CL MLSep 30, 2015

Polish to English Statistical Machine Translation

arXiv:1510.00001v12.22 citations

Originality Synthesis-oriented

AI Analysis

This work addresses translation challenges for Polish-English spoken language, but it is incremental as it applies existing methods to new data without major innovations.

The study investigated how different training configurations affect a Polish to English statistical machine translation system for spoken language, using multiple corpora and evaluation metrics to assess translation quality.

This research explores the effects of various training settings on a Polish to English Statistical Machine Translation system for spoken language. Various elements of the TED, Europarl, and OPUS parallel text corpora were used as the basis for training of language models, for development, tuning and testing of the translation system. The BLEU, NIST, METEOR and TER metrics were used to evaluate the effects of the data preparations on the translation results.

View on arXiv PDF

Similar