CLJan 9, 2022

Indian Language Wordnets and their Linkages with Princeton WordNet

arXiv:2201.02977v11088 citations
Originality Synthesis-oriented
AI Analysis

This work addresses a resource gap for NLP in Indian languages by sharing manually created wordnet linkages, though it is incremental as it builds on existing wordnet frameworks.

The authors tackled the scarcity of linked wordnet resources for Indian languages by releasing mappings of 18 Indian language wordnets linked with Princeton WordNet, which provides a manually created gold standard for NLP applications.

Wordnets are rich lexico-semantic resources. Linked wordnets are extensions of wordnets, which link similar concepts in wordnets of different languages. Such resources are extremely useful in many Natural Language Processing (NLP) applications, primarily those based on knowledge-based approaches. In such approaches, these resources are considered as gold standard/oracle. Thus, it is crucial that these resources hold correct information. Thereby, they are created by human experts. However, human experts in multiple languages are hard to come by. Thus, the community would benefit from sharing of such manually created resources. In this paper, we release mappings of 18 Indian language wordnets linked with Princeton WordNet. We believe that availability of such resources will have a direct impact on the progress in NLP for these languages.

Foundations

The foundational work for this paper's niche, ranked by how specifically the neighbourhood builds on it — not by global fame.

Your Notes