CLJul 2, 2020

Project PIAF: Building a Native French Question-Answering Dataset

Rachel Keraron, Guillaume Lancrenon, Mathilde Bras, Frédéric Allary, Gilles Moyse, Thomas Scialom, Edmundo-Pavel Soriano-Morales, Jacopo Staiano

arXiv:2007.00968v131.21000 citationsHas Code

Originality Synthesis-oriented

AI Analysis

This addresses the problem of data scarcity for French question answering, providing a resource for researchers and practitioners, though it is incremental as it extends existing data collection methods to a new language.

The authors tackled the lack of non-English data for question answering by creating a native French dataset through a participatory effort, and they released the annotation tool, data, and preliminary baselines.

Motivated by the lack of data for non-English languages, in particular for the evaluation of downstream tasks such as Question Answering, we present a participatory effort to collect a native French Question Answering Dataset. Furthermore, we describe and publicly release the annotation tool developed for our collection effort, along with the data obtained and preliminary baselines.

View on arXiv PDF Code

Similar