CLFeb 27, 2017

A case study on English-Malayalam Machine Translation

arXiv:1702.08217v18 citations
Originality Synthesis-oriented
AI Analysis

This is an incremental study providing specific insights for improving translation in a low-resource language pair, English-Malayalam.

The paper tackled machine translation between English and Malayalam by comparing Statistical Machine Translation (SMT) and Rule-Based Machine Translation (RBMT) systems, finding that SMT outperforms RBMT and that translation direction affects performance, with SMT better for English-Malayalam and RBMT better for Malayalam-English.

In this paper we present our work on a case study on Statistical Machine Translation (SMT) and Rule based machine translation (RBMT) for translation from English to Malayalam and Malayalam to English. One of the motivations of our study is to make a three way performance comparison, such as, a) SMT and RBMT b) English to Malayalam SMT and Malayalam to English SMT c) English to Malayalam RBMT and Malayalam to English RBMT. We describe the development of English to Malayalam and Malayalam to English baseline phrase based SMT system and the evaluation of its performance compared against the RBMT system. Based on our study the observations are: a) SMT systems outperform RBMT systems, b) In the case of SMT, English - Malayalam systems perform better than that of Malayalam - English systems, c) In the case RBMT, Malayalam to English systems are performing better than English to Malayalam systems. Based on our evaluations and detailed error analysis, we describe the requirements of incorporating morphological processing into the SMT to improve the accuracy of translation.

Foundations

The foundational work for this paper's niche, ranked by how specifically the neighbourhood builds on it — not by global fame.

Your Notes