CL AIFeb 12, 2022

Ultra-fine Entity Typing with Indirect Supervision from Natural Language Inference

arXiv:2202.06167v132.0635 citationsHas Code

Originality Incremental advance

AI Analysis

This work improves entity typing for natural language processing applications by enabling better generalization to rare or unseen types, though it is incremental in its approach.

The paper tackles the problem of ultra-fine entity typing (UFET) by formulating it as a natural language inference (NLI) problem to address data scarcity and limited generalization, achieving state-of-the-art performance on the UFET task and strong results on other benchmarks with unseen types.

The task of ultra-fine entity typing (UFET) seeks to predict diverse and free-form words or phrases that describe the appropriate types of entities mentioned in sentences. A key challenge for this task lies in the large amount of types and the scarcity of annotated data per type. Existing systems formulate the task as a multi-way classification problem and train directly or distantly supervised classifiers. This causes two issues: (i) the classifiers do not capture the type semantics since types are often converted into indices; (ii) systems developed in this way are limited to predicting within a pre-defined type set, and often fall short of generalizing to types that are rarely seen or unseen in training. This work presents LITE, a new approach that formulates entity typing as a natural language inference (NLI) problem, making use of (i) the indirect supervision from NLI to infer type information meaningfully represented as textual hypotheses and alleviate the data scarcity issue, as well as (ii) a learning-to-rank objective to avoid the pre-defining of a type set. Experiments show that, with limited training data, LITE obtains state-of-the-art performance on the UFET task. In addition, LITE demonstrates its strong generalizability, by not only yielding best results on other fine-grained entity typing benchmarks, more importantly, a pre-trained LITE system works well on new data containing unseen types.

View on arXiv PDF Code

Similar