CLJul 2, 2020

Project PIAF: Building a Native French Question-Answering Dataset

arXiv:2007.00968v11000 citations
AI Analysis

This addresses the problem of data scarcity for French question answering, providing a resource for researchers and practitioners, though it is incremental as it extends existing data collection methods to a new language.

The authors tackled the lack of non-English data for question answering by creating a native French dataset through a participatory effort, and they released the annotation tool, data, and preliminary baselines.

Motivated by the lack of data for non-English languages, in particular for the evaluation of downstream tasks such as Question Answering, we present a participatory effort to collect a native French Question Answering Dataset. Furthermore, we describe and publicly release the annotation tool developed for our collection effort, along with the data obtained and preliminary baselines.

Code Implementations1 repo
Foundations

The foundational work for this paper's niche, ranked by how specifically the neighbourhood builds on it — not by global fame.

Your Notes