DLCLIRMar 23, 2016

CONDITOR1: Topic Maps and DITA labelling tool for textual documents with historical information

arXiv:1603.07313v1
Originality Synthesis-oriented
AI Analysis

This work addresses document management and retrieval challenges for historians and archivists, though it appears incremental as it builds on existing models and databases.

The researchers developed CONDITOR1, a tool for labeling historical documents with topic maps and DITA models, achieving correct entity identification and improving information retrieval accuracy through integration with an object-oriented database and Lucene search.

Conditor is a software tool which works with textual documents containing historical information. The purpose of this work two-fold: firstly to show the validity of the developed engine to correctly identify and label the entities of the universe of discourse with a labelled-combined XTM-DITA model. Secondly to explain the improvements achieved in the information retrieval process thanks to the use of a object-oriented database (JPOX) as well as its integration into the Lucene-type database search process to not only accomplish more accurate searches, but to also help the future development of a recommender system. We finish with a brief demo in a 3D-graph of the results of the aforementioned search.

Foundations

The foundational work for this paper's niche, ranked by how specifically the neighbourhood builds on it — not by global fame.

Your Notes