AI CL DBNov 23, 2018

Competency Questions and SPARQL-OWL Queries Dataset and Analysis

Dawid Wisniewski, Jedrzej Potoniec, Agnieszka Lawrynowicz, C. Maria Keet

arXiv:1811.09529v15.67 citationsHas Code

Originality Synthesis-oriented

AI Analysis

This work addresses engineering shortcomings in ontology development for researchers and practitioners, but it is incremental as it builds on existing methodologies by providing a dataset and analysis.

The paper tackled the limited publication and formalization of Competency Questions (CQs) in ontology development by gathering and analyzing a dataset of 234 CQs and their SPARQL-OWL translations, resulting in a 5-fold increase to 106 distinct CQ patterns and revealing relationships between these patterns and 46 query signatures.

Competency Questions (CQs) are natural language questions outlining and constraining the scope of knowledge represented by an ontology. Despite that CQs are a part of several ontology engineering methodologies, we have observed that the actual publication of CQs for the available ontologies is very limited and even scarcer is the publication of their respective formalisations in terms of, e.g., SPARQL queries. This paper aims to contribute to addressing the engineering shortcomings of using CQs in ontology development, to facilitate wider use of CQs. In order to understand the relation between CQs and the queries over the ontology to test the CQs on an ontology, we gather, analyse, and publicly release a set of 234 CQs and their translations to SPARQL-OWL for several ontologies in different domains developed by different groups. We analysed the CQs in two principal ways. The first stage focused on a linguistic analysis of the natural language text itself, i.e., a lexico-syntactic analysis without any presuppositions of ontology elements, and a subsequent step of semantic analysis in order to find patterns. This increased diversity of CQ sources resulted in a 5-fold increase of hitherto published patterns, to 106 distinct CQ patterns, which have a limited subset of few patterns shared across the CQ sets from the different ontologies. Next, we analysed the relation between the found CQ patterns and the 46 SPARQL-OWL query signatures, which revealed that one CQ pattern may be realised by more than one SPARQL-OWL query signature, and vice versa. We hope that our work will contribute to establishing common practices, templates, automation, and user tools that will support CQ formulation, formalisation, execution, and general management.

View on arXiv PDF Code

Similar