HCAIETIRLGJul 22, 2024

The Ontoverse: Democratising Access to Knowledge Graph-based Data Through a Cartographic Interface

arXiv:2408.03339v11 citationsh-index: 10
Originality Synthesis-oriented
AI Analysis

This addresses the challenge for researchers in interdisciplinary fields who need to efficiently explore connected research works, though it appears incremental by combining existing techniques like NLP and topic modelling with a novel visual interface.

The paper tackles the problem of navigating complex scientific literature by developing a cartographic interface that integrates domain knowledge into a knowledge graph, enabling intuitive exploration and discovery. The result is a scalable system demonstrated on a biomedical literature corpus with hundreds of named entities.

As the number of scientific publications and preprints is growing exponentially, several attempts have been made to navigate this complex and increasingly detailed landscape. These have almost exclusively taken unsupervised approaches that fail to incorporate domain knowledge and lack the structural organisation required for intuitive interactive human exploration and discovery. Especially in highly interdisciplinary fields, a deep understanding of the connectedness of research works across topics is essential for generating insights. We have developed a unique approach to data navigation that leans on geographical visualisation and uses hierarchically structured domain knowledge to enable end-users to explore knowledge spaces grounded in their desired domains of interest. This can take advantage of existing ontologies, proprietary intelligence schemata, or be directly derived from the underlying data through hierarchical topic modelling. Our approach uses natural language processing techniques to extract named entities from the underlying data and normalise them against relevant domain references and navigational structures. The knowledge is integrated by first calculating similarities between entities based on their shared extracted feature space and then by alignment to the navigational structures. The result is a knowledge graph that allows for full text and semantic graph query and structured topic driven navigation. This allows end-users to identify entities relevant to their needs and access extensive graph analytics. The user interface facilitates graphical interaction with the underlying knowledge graph and mimics a cartographic map to maximise ease of use and widen adoption. We demonstrate an exemplar project using our generalisable and scalable infrastructure for an academic biomedical literature corpus that is grounded against hundreds of different named domain entities.

Foundations

The foundational work for this paper's niche, ranked by how specifically the neighbourhood builds on it — not by global fame.

Your Notes