CLApr 13, 2022

Multilingual Event Linking to Wikidata

CMU
arXiv:2204.06535v3630 citationsh-index: 39
Originality Synthesis-oriented
AI Analysis

This work addresses the problem of linking events across languages for knowledge base applications, but it is incremental as it adapts existing methods to a new dataset.

The authors tackled the problem of multilingual event linking to Wikidata by automatically compiling a large-scale dataset with 1.8M mentions across 44 languages and 10.9K events, and found that biencoder and crossencoder models significantly outperformed a BM25+ baseline, with the crosslingual task being more challenging.

We present a task of multilingual linking of events to a knowledge base. We automatically compile a large-scale dataset for this task, comprising of 1.8M mentions across 44 languages referring to over 10.9K events from Wikidata. We propose two variants of the event linking task: 1) multilingual, where event descriptions are from the same language as the mention, and 2) crosslingual, where all event descriptions are in English. On the two proposed tasks, we compare multiple event linking systems including BM25+ (Lv and Zhai, 2011) and multilingual adaptations of the biencoder and crossencoder architectures from BLINK (Wu et al., 2020). In our experiments on the two task variants, we find both biencoder and crossencoder models significantly outperform the BM25+ baseline. Our results also indicate that the crosslingual task is in general more challenging than the multilingual task. To test the out-of-domain generalization of the proposed linking systems, we additionally create a Wikinews-based evaluation set. We present qualitative analysis highlighting various aspects captured by the proposed dataset, including the need for temporal reasoning over context and tackling diverse event descriptions across languages.

Code Implementations1 repo
Foundations

The foundational work for this paper's niche, ranked by how specifically the neighbourhood builds on it — not by global fame.

Your Notes