CLFeb 21, 2019

Development of a classifiers/quantifiers dictionary towards French-Japanese MT

arXiv:1902.08061v1616 citations
AI Analysis

This addresses translation challenges for distant language pairs like French-Japanese, but it is incremental as it builds on existing UNL-UW methods for dictionary creation.

The paper tackles the problem of classifiers/quantifiers (CQs) causing lexical ambiguity and recognition issues in French-Japanese machine translation by developing a CQs dictionary from annotated corpus using UNL-UWs, resulting in a bilingual/multilingual dictionary to improve MT accuracy.

Although classifiers/quantifiers (CQs) expressions appear frequently in everyday communications or written documents, they are described neither in classical bilingual paper dictionaries , nor in machine-readable dictionaries. The paper describes a CQs dictionary, edited from the corpus we have annotated, and its usage in the framework of French-Japanese machine translation (MT). CQs treatment in MT often causes problems of lexical ambiguity, polylexical phrase recognition difficulties in analysis and doubtful output in transfer-generation, in particular for distant languages pairs like French and Japanese. Our basic treatment of CQs is to annotate the corpus by UNL-UWs (Universal Networking Language-Universal words) 1 , and then to produce a bilingual or multilingual dictionary of CQs, based on synonymy through identity of UWs.

Foundations

The foundational work for this paper's niche, ranked by how specifically the neighbourhood builds on it — not by global fame.

Your Notes