CECLJun 29, 2024

SHADE: Semantic Hypernym Annotator for Domain-specific Entities -- DnD Domain Use Case

arXiv:2407.00407v11 citations
Originality Synthesis-oriented
AI Analysis

This addresses the challenge of costly and inconsistent annotation for specialized domains like fantasy literature, though it is incremental as it applies existing annotation methods to a new domain.

The paper tackles the problem of manual data annotation in NLP by introducing SHADE, an annotation software designed for the high fantasy literature domain, specifically Dungeons and Dragons lore from the Forgotten Realms Fandom Wiki, to reduce human error and improve consistency.

Manual data annotation is an important NLP task but one that takes considerable amount of resources and effort. In spite of the costs, labeling and categorizing entities is essential for NLP tasks such as semantic evaluation. Even though annotation can be done by non-experts in most cases, due to the fact that this requires human labor, the process is costly. Another major challenge encountered in data annotation is maintaining the annotation consistency. Annotation efforts are typically carried out by teams of multiple annotators. The annotations need to maintain the consistency in relation to both the domain truth and annotation format while reducing human errors. Annotating a specialized domain that deviates significantly from the general domain, such as fantasy literature, will see a lot of human error and annotator disagreement. So it is vital that proper guidelines and error reduction mechanisms are enforced. One such way to enforce these constraints is using a specialized application. Such an app can ensure that the notations are consistent, and the labels can be pre-defined or restricted reducing the room for errors. In this paper, we present SHADE, an annotation software that can be used to annotate entities in the high fantasy literature domain. Specifically in Dungeons and Dragons lore extracted from the Forgotten Realms Fandom Wiki.

Foundations

The foundational work for this paper's niche, ranked by how specifically the neighbourhood builds on it — not by global fame.

Your Notes