CLDec 18, 2023

Entity or Relation Embeddings? An Analysis of Encoding Strategies for Relation Extraction

arXiv:2312.11062v222 citationsh-index: 31EMNLP
Originality Incremental advance
AI Analysis

This work addresses the problem of improving relation extraction accuracy for natural language processing applications, offering a novel hybrid approach with broad impact.

The paper tackles the challenge of relation extraction by proposing a method that combines entity embeddings with relation embeddings derived from a masked prompt, resulting in a simple model that outperforms state-of-the-art across several benchmarks.

Relation extraction is essentially a text classification problem, which can be tackled by fine-tuning a pre-trained language model (LM). However, a key challenge arises from the fact that relation extraction cannot straightforwardly be reduced to sequence or token classification. Existing approaches therefore solve the problem in an indirect way: they fine-tune an LM to learn embeddings of the head and tail entities, and then predict the relationship from these entity embeddings. Our hypothesis in this paper is that relation extraction models can be improved by capturing relationships in a more direct way. In particular, we experiment with appending a prompt with a [MASK] token, whose contextualised representation is treated as a relation embedding. While, on its own, this strategy significantly underperforms the aforementioned approach, we find that the resulting relation embeddings are highly complementary to what is captured by embeddings of the head and tail entity. By jointly considering both types of representations, we end up with a simple model that outperforms the state-of-the-art across several relation extraction benchmarks.

Code Implementations1 repo
Foundations

The foundational work for this paper's niche, ranked by how specifically the neighbourhood builds on it — not by global fame.

Your Notes