CLAILGOct 23, 2023

What do Deck Chairs and Sun Hats Have in Common? Uncovering Shared Properties in Large Concept Vocabularies

arXiv:2310.14793v1132 citationsh-index: 31
Originality Synthesis-oriented
AI Analysis

This work addresses a domain-specific issue in natural language processing for tasks like entity typing, offering an incremental improvement.

The paper tackles the problem of concept embeddings primarily capturing taxonomic structure by proposing a strategy to identify shared properties among concepts, which is then applied to ultra-fine entity typing to improve state-of-the-art model performance.

Concepts play a central role in many applications. This includes settings where concepts have to be modelled in the absence of sentence context. Previous work has therefore focused on distilling decontextualised concept embeddings from language models. But concepts can be modelled from different perspectives, whereas concept embeddings typically mostly capture taxonomic structure. To address this issue, we propose a strategy for identifying what different concepts, from a potentially large concept vocabulary, have in common with others. We then represent concepts in terms of the properties they share with the other concepts. To demonstrate the practical usefulness of this way of modelling concepts, we consider the task of ultra-fine entity typing, which is a challenging multi-label classification problem. We show that by augmenting the label set with shared properties, we can improve the performance of the state-of-the-art models for this task.

Foundations

The foundational work for this paper's niche, ranked by how specifically the neighbourhood builds on it — not by global fame.

Your Notes