CLAug 24, 2021

Robustness Evaluation of Entity Disambiguation Using Prior Probes:the Case of Entity Overshadowing

Vera Provatorova, Svitlana Vakulenko, Samarth Bhargav, Evangelos Kanoulas

arXiv:2108.10949v230.8663 citations

Originality Synthesis-oriented

AI Analysis

This work addresses evaluation biases in entity linking for NLP researchers, but it is incremental as it focuses on benchmarking rather than new methods.

The authors tackled the problem of overestimated entity disambiguation performance due to prior probability bias in existing datasets by introducing the ShadowLink dataset with 16K short text snippets. Their evaluation revealed a significant accuracy gap between common and rare entities across popular systems, demonstrating the impact of entity overshadowing.

Entity disambiguation (ED) is the last step of entity linking (EL), when candidate entities are reranked according to the context they appear in. All datasets for training and evaluating models for EL consist of convenience samples, such as news articles and tweets, that propagate the prior probability bias of the entity distribution towards more frequently occurring entities. It was previously shown that the performance of the EL systems on such datasets is overestimated since it is possible to obtain higher accuracy scores by merely learning the prior. To provide a more adequate evaluation benchmark, we introduce the ShadowLink dataset, which includes 16K short text snippets annotated with entity mentions. We evaluate and report the performance of popular EL systems on the ShadowLink benchmark. The results show a considerable difference in accuracy between more and less common entities for all of the EL systems under evaluation, demonstrating the effects of prior probability bias and entity overshadowing.

View on arXiv PDF

Similar