AINov 21, 2023

Towards a Gateway for Knowledge Graph Schemas Collection, Analysis, and Embedding

arXiv:2311.12465v12 citationsh-index: 58
Originality Synthesis-oriented
AI Analysis

This addresses a data accessibility and processing problem for researchers working with knowledge graphs, though it appears incremental as it builds on existing catalogs.

The paper tackles the difficulty of finding and manipulating knowledge graph data for training statistical models by introducing LiveSchema, a gateway that aggregates 1000 datasets from 4 sources and provides facilities for querying, transforming into matrices, and generating models.

One of the significant barriers to the training of statistical models on knowledge graphs is the difficulty that scientists have in finding the best input data to address their prediction goal. In addition to this, a key challenge is to determine how to manipulate these relational data, which are often in the form of particular triples (i.e., subject, predicate, object), to enable the learning process. Currently, many high-quality catalogs of knowledge graphs, are available. However, their primary goal is the re-usability of these resources, and their interconnection, in the context of the Semantic Web. This paper describes the LiveSchema initiative, namely, a first version of a gateway that has the main scope of leveraging the gold mine of data collected by many existing catalogs collecting relational data like ontologies and knowledge graphs. At the current state, LiveSchema contains - 1000 datasets from 4 main sources and offers some key facilities, which allow to: i) evolving LiveSchema, by aggregating other source catalogs and repositories as input sources; ii) querying all the collected resources; iii) transforming each given dataset into formal concept analysis matrices that enable analysis and visualization services; iv) generating models and tensors from each given dataset.

Foundations

The foundational work for this paper's niche, ranked by how specifically the neighbourhood builds on it — not by global fame.

Your Notes