CLJul 2, 2022

MIA 2022 Shared Task: Evaluating Cross-lingual Open-Retrieval Question Answering for 16 Diverse Languages

Akari Asai, Shayne Longpre, Jungo Kasai, Chia-Hsuan Lee, Rui Zhang, Junjie Hu, Ikuya Yamada, Jonathan H. Clark, Eunsol Choi

arXiv:2207.00758v131.9631 citationsh-index: 33

Originality Synthesis-oriented

AI Analysis

This work addresses the problem of evaluating multilingual QA systems for researchers, but it is incremental as it builds on existing datasets and tasks.

The paper tackled cross-lingual open-retrieval question answering for 16 diverse languages by adapting and annotating datasets, with the best system achieving 32.2 F1, a 4.5-point improvement over the baseline.

We present the results of the Workshop on Multilingual Information Access (MIA) 2022 Shared Task, evaluating cross-lingual open-retrieval question answering (QA) systems in 16 typologically diverse languages. In this task, we adapted two large-scale cross-lingual open-retrieval QA datasets in 14 typologically diverse languages, and newly annotated open-retrieval QA data in 2 underrepresented languages: Tagalog and Tamil. Four teams submitted their systems. The best system leveraging iteratively mined diverse negative examples and larger pretrained models achieves 32.2 F1, outperforming our baseline by 4.5 points. The second best system uses entity-aware contextualized representations for document retrieval, and achieves significant improvements in Tamil (20.8 F1), whereas most of the other systems yield nearly zero scores.

View on arXiv PDF

Similar