IRJan 14, 2022

The Lokahi Prototype: Toward the automatic Extraction of Entity Relationship Models from Text

arXiv:2201.05327v13 citations
Originality Synthesis-oriented
AI Analysis

This work addresses the challenge of automating entity relationship extraction from text, but it is incremental as it builds on existing methods and focuses on exploratory research.

The paper tackles the problem of automatically generating semantic data models from text by extracting entities and relationships, presenting the Lokahi prototype which uses TF*IDF for entity extraction and co-occurrence statistics for relationship generation.

Entity relationship extraction envisions the automatic generation of semantic data models from collections of text, by automatic recognition of entities, by association of entities to form relationships, and by classifying these instances to assign them to entity sets (or classes) and relationship sets (or associations). As a first step in this direction, the Lokahi prototype can extract entities based on the TF*IDF measure, and generate semantic relationships based on document-level co-occurrence statistics, for example with likelihood ratios and pointwise mutual information. This paper presents results of an explorative, prototypical, qualitative and synthetic research, summarizes insights from two research projects and, based on this, indicates an outline for further research in the field of entity relationship extraction from text.

Foundations

The foundational work for this paper's niche, ranked by how specifically the neighbourhood builds on it — not by global fame.

Your Notes