Multilingual Central Repository: a Cross-lingual Framework for Developing Wordnets
This work addresses the problem of resource development for language processing researchers, but it is incremental as it builds upon existing wordnet and ontology frameworks.
The paper tackles the costly and complex process of building language resources by describing the cross-lingual framework used to develop the Multilingual Central Repository (MCR), a multilingual knowledge base that includes wordnets for seven languages and several ontologies, with its state and tools presented as of 2017.
Language resources are necessary for language processing,but building them is costly, involves many researches from different areas and needs constant updating. In this paper, we describe the crosslingual framework used for developing the Multilingual Central Repository (MCR), a multilingual knowledge base that includes wordnets of Basque, Catalan, English, Galician, Portuguese, Spanish and the following ontologies: Base Concepts, Top Ontology, WordNet Domains and Suggested Upper Merged Ontology. We present the story of MCR, its state in 2017 and the developed tools.