CLJul 4, 2018

Towards Automation of Sense-type Identification of Verbs in OntoSenseNet(Telugu)

Sreekavitha Parupalli, Vijjini Anvesh Rao, Radhika Mamidi

arXiv:1807.01677v10.2

Originality Synthesis-oriented

AI Analysis

This work incrementally enriches a manually developed Telugu lexicon to support further research in natural language processing for the Telugu language.

The paper tackled the problem of automating sense-type identification for Telugu verbs in OntoSenseNet by applying classifiers, with results showing effectiveness using SVM and Adaboost ensemble methods.

In this paper, we discuss the enrichment of a manually developed resource of Telugu lexicon, OntoSenseNet. OntoSenseNet is a ontological sense annotated lexicon that marks each verb of Telugu with a primary and a secondary sense. The area of research is relatively recent but has a large scope of development. We provide an introductory work to enrich the OntoSenseNet to promote further research in Telugu. Classifiers are adopted to learn the sense relevant features of the words in the resource and also to automate the tagging of sense-types for verbs. We perform a comparative analysis of different classifiers applied on OntoSenseNet. The results of the experiment prove that automated enrichment of the resource is effective using SVM classifiers and Adaboost ensemble.

View on arXiv PDF

Similar