CLLGMLApr 11, 2017

Persian Wordnet Construction using Supervised Learning

arXiv:1704.03223v19 citations
Originality Synthesis-oriented
AI Analysis

This addresses the problem of building a comprehensive Persian wordnet for natural language processing applications, but it is incremental as it applies an existing supervised method to a new language-specific dataset.

The paper tackles automated Persian wordnet construction by using a supervised classification system with seven features to discriminate correct links between Persian words and Princeton WordNet synsets, achieving state-of-the-art results with a precision of 91.18% and over 16,000 words and 22,000 synsets.

This paper presents an automated supervised method for Persian wordnet construction. Using a Persian corpus and a bi-lingual dictionary, the initial links between Persian words and Princeton WordNet synsets have been generated. These links will be discriminated later as correct or incorrect by employing seven features in a trained classification system. The whole method is just a classification system, which has been trained on a train set containing FarsNet as a set of correct instances. State of the art results on the automatically derived Persian wordnet is achieved. The resulted wordnet with a precision of 91.18% includes more than 16,000 words and 22,000 synsets.

Foundations

The foundational work for this paper's niche, ranked by how specifically the neighbourhood builds on it — not by global fame.

Your Notes