CLJun 28, 2022

Few-Shot Fine-Grained Entity Typing with Automatic Label Interpretation and Instance Generation

arXiv:2206.13746v122 citationsh-index: 28Has Code
Originality Incremental advance
AI Analysis

This work addresses the problem of fine-grained entity typing with limited labeled data for natural language processing applications, representing an incremental advance over existing prompt-based methods.

The paper tackles few-shot fine-grained entity typing by proposing a framework that automatically interprets entity type labels using few-shot instances and label hierarchy, and generates new instances to enlarge the training set, achieving significant performance improvements on three benchmark datasets.

We study the problem of few-shot Fine-grained Entity Typing (FET), where only a few annotated entity mentions with contexts are given for each entity type. Recently, prompt-based tuning has demonstrated superior performance to standard fine-tuning in few-shot scenarios by formulating the entity type classification task as a ''fill-in-the-blank'' problem. This allows effective utilization of the strong language modeling capability of Pre-trained Language Models (PLMs). Despite the success of current prompt-based tuning approaches, two major challenges remain: (1) the verbalizer in prompts is either manually designed or constructed from external knowledge bases, without considering the target corpus and label hierarchy information, and (2) current approaches mainly utilize the representation power of PLMs, but have not explored their generation power acquired through extensive general-domain pre-training. In this work, we propose a novel framework for few-shot FET consisting of two modules: (1) an entity type label interpretation module automatically learns to relate type labels to the vocabulary by jointly leveraging few-shot instances and the label hierarchy, and (2) a type-based contextualized instance generator produces new instances based on given instances to enlarge the training set for better generalization. On three benchmark datasets, our model outperforms existing methods by significant margins. Code can be found at https://github.com/teapot123/Fine-Grained-Entity-Typing.

Code Implementations1 repo
Foundations

The foundational work for this paper's niche, ranked by how specifically the neighbourhood builds on it — not by global fame.

Your Notes