Competency Questions and SPARQL-OWL Queries Dataset and Analysis

Agnieszka Lawrynowicz; C. Maria Keet; Dawid Wisniewski; Jedrzej Potoniec

arxiv: 1811.09529 · v1 · pith:2S2QLMSCnew · submitted 2018-11-23 · 💻 cs.AI · cs.CL· cs.DB

Competency Questions and SPARQL-OWL Queries Dataset and Analysis

Dawid Wisniewski , Jedrzej Potoniec , Agnieszka Lawrynowicz , C. Maria Keet This is my paper

classification 💻 cs.AI cs.CLcs.DB

keywords ontologypatternsanalysissparql-owldifferentontologiesqueriesquestions

0 comments

read the original abstract

Competency Questions (CQs) are natural language questions outlining and constraining the scope of knowledge represented by an ontology. Despite that CQs are a part of several ontology engineering methodologies, we have observed that the actual publication of CQs for the available ontologies is very limited and even scarcer is the publication of their respective formalisations in terms of, e.g., SPARQL queries. This paper aims to contribute to addressing the engineering shortcomings of using CQs in ontology development, to facilitate wider use of CQs. In order to understand the relation between CQs and the queries over the ontology to test the CQs on an ontology, we gather, analyse, and publicly release a set of 234 CQs and their translations to SPARQL-OWL for several ontologies in different domains developed by different groups. We analysed the CQs in two principal ways. The first stage focused on a linguistic analysis of the natural language text itself, i.e., a lexico-syntactic analysis without any presuppositions of ontology elements, and a subsequent step of semantic analysis in order to find patterns. This increased diversity of CQ sources resulted in a 5-fold increase of hitherto published patterns, to 106 distinct CQ patterns, which have a limited subset of few patterns shared across the CQ sets from the different ontologies. Next, we analysed the relation between the found CQ patterns and the 46 SPARQL-OWL query signatures, which revealed that one CQ pattern may be realised by more than one SPARQL-OWL query signature, and vice versa. We hope that our work will contribute to establishing common practices, templates, automation, and user tools that will support CQ formulation, formalisation, execution, and general management.

This paper has not been read by Pith yet.

discussion (0)

Forward citations

Cited by 1 Pith paper

Reviewed papers in the Pith corpus that reference this work. Sorted by Pith novelty score.

CLaRO: a Data-driven CNL for Specifying Competency Questions
cs.AI 2019-07 unverdicted novelty 6.0

CLaRO is a data-driven CNL using 93 templates and 41 variants that covers about 90% of unseen competency questions for ontologies, while also flagging invalid questions.