CLMar 11, 2025

LSC-Eval: A General Framework to Evaluate Methods for Assessing Dimensions of Lexical Semantic Change Using LLM-Generated Synthetic Data

Naomi Baes, Raphaël Merx, Nick Haslam, Ekaterina Vylomova, Haim Dubossarsky

arXiv:2503.08042v26.72 citationsh-index: 15ACL

Originality Incremental advance

AI Analysis

This provides a tool for dimension- and domain-specific benchmarking of LSC methods, particularly useful for social sciences, but it is incremental as it builds on existing frameworks like SIBling.

The paper tackled the lack of benchmark datasets for evaluating lexical semantic change (LSC) methods by proposing LSC-Eval, a framework that generates synthetic data to simulate changes along specific dimensions like sentiment, intensity, and breadth, and found that tailored methods effectively detect these changes while a state-of-the-art model struggles with affective dimensions.

Lexical Semantic Change (LSC) provides insight into cultural and social dynamics. Yet, the validity of methods for measuring different kinds of LSC remains unestablished due to the absence of historical benchmark datasets. To address this gap, we propose LSC-Eval, a novel three-stage general-purpose evaluation framework to: (1) develop a scalable methodology for generating synthetic datasets that simulate theory-driven LSC using In-Context Learning and a lexical database; (2) use these datasets to evaluate the sensitivity of computational methods to synthetic change; and (3) assess their suitability for detecting change in specific dimensions and domains. We apply LSC-Eval to simulate changes along the Sentiment, Intensity, and Breadth (SIB) dimensions, as defined in the SIBling framework, using examples from psychology. We then evaluate the ability of selected methods to detect these controlled interventions. Our findings validate the use of synthetic benchmarks, demonstrate that tailored methods effectively detect changes along SIB dimensions, and reveal that a state-of-the-art LSC model faces challenges in detecting affective dimensions of LSC. LSC-Eval offers a valuable tool for dimension- and domain-specific benchmarking of LSC methods, with particular relevance to the social sciences.

View on arXiv PDF

Similar