CLAIOct 27, 2021

Connect-the-Dots: Bridging Semantics between Words and Definitions via Aligning Word Sense Inventories

arXiv:2110.14091v1663 citations
Originality Incremental advance
AI Analysis

This addresses the challenge of limited training data for rare word senses in WSD, enabling better generalization across different sense inventories, though it is an incremental improvement over existing methods.

The paper tackles the data sparsity problem in Word Sense Disambiguation (WSD) by aligning definition sentences from different sense inventories to collect rich lexical knowledge, resulting in improved predictions on both frequent and rare word senses with gains of 1.2% on the All-Words WSD Task and 4.3% on the Low-Shot WSD Task.

Word Sense Disambiguation (WSD) aims to automatically identify the exact meaning of one word according to its context. Existing supervised models struggle to make correct predictions on rare word senses due to limited training data and can only select the best definition sentence from one predefined word sense inventory (e.g., WordNet). To address the data sparsity problem and generalize the model to be independent of one predefined inventory, we propose a gloss alignment algorithm that can align definition sentences (glosses) with the same meaning from different sense inventories to collect rich lexical knowledge. We then train a model to identify semantic equivalence between a target word in context and one of its glosses using these aligned inventories, which exhibits strong transfer capability to many WSD tasks. Experiments on benchmark datasets show that the proposed method improves predictions on both frequent and rare word senses, outperforming prior work by 1.2% on the All-Words WSD Task and 4.3% on the Low-Shot WSD Task. Evaluation on WiC Task also indicates that our method can better capture word meanings in context.

Code Implementations1 repo
Foundations

The foundational work for this paper's niche, ranked by how specifically the neighbourhood builds on it — not by global fame.

Your Notes