Polish to English Statistical Machine Translation
This work addresses translation challenges for Polish-English spoken language, but it is incremental as it applies existing methods to new data without major innovations.
The study investigated how different training configurations affect a Polish to English statistical machine translation system for spoken language, using multiple corpora and evaluation metrics to assess translation quality.
This research explores the effects of various training settings on a Polish to English Statistical Machine Translation system for spoken language. Various elements of the TED, Europarl, and OPUS parallel text corpora were used as the basis for training of language models, for development, tuning and testing of the translation system. The BLEU, NIST, METEOR and TER metrics were used to evaluate the effects of the data preparations on the translation results.