CLOct 11, 2024

Comparative Analysis of Static and Contextual Embeddings for Analyzing Semantic Changes in Medieval Latin Charters

arXiv:2410.09283v11 citationsh-index: 4
Originality Synthesis-oriented
AI Analysis

This provides a computational analysis for historians and linguists studying Medieval Latin, but it is incremental as it confirms existing findings in a new domain.

This paper tackled the problem of analyzing semantic changes in Medieval Latin charters before and after the Norman Conquest of 1066 C.E., using the DEEDS database, and found that contextual embeddings outperformed static word embeddings for this task in a scarce historical corpus.

The Norman Conquest of 1066 C.E. brought profound transformations to England's administrative, societal, and linguistic practices. The DEEDS (Documents of Early England Data Set) database offers a unique opportunity to explore these changes by examining shifts in word meanings within a vast collection of Medieval Latin charters. While computational linguistics typically relies on vector representations of words like static and contextual embeddings to analyze semantic changes, existing embeddings for scarce and historical Medieval Latin are limited and may not be well-suited for this task. This paper presents the first computational analysis of semantic change pre- and post-Norman Conquest and the first systematic comparison of static and contextual embeddings in a scarce historical data set. Our findings confirm that, consistent with existing studies, contextual embeddings outperform static word embeddings in capturing semantic change within a scarce historical corpus.

Foundations

The foundational work for this paper's niche, ranked by how specifically the neighbourhood builds on it — not by global fame.

Your Notes