CLDec 20, 2019

SberQuAD -- Russian Reading Comprehension Dataset: Description and Analysis

Pavel Efimov, Andrey Chertok, Leonid Boytsov, Pavel Braslavski

arXiv:1912.09723v365 citations

Originality Synthesis-oriented

AI Analysis

This addresses the lack of a properly presented Russian dataset for the NLP community, though it is incremental as it adapts an existing framework.

The authors introduced SberQuAD, a large-scale Russian reading comprehension dataset analogous to Stanford SQuAD, and provided its description, analysis, and baseline results.

SberQuAD -- a large scale analog of Stanford SQuAD in the Russian language - is a valuable resource that has not been properly presented to the scientific community. We fill this gap by providing a description, a thorough analysis, and baseline experimental results.

View on arXiv PDF

Similar