CLDec 20, 2019
SberQuAD -- Russian Reading Comprehension Dataset: Description and Analysis
arXiv:1912.09723v365 citations
Originality Synthesis-oriented
AI Analysis
This addresses the lack of a properly presented Russian dataset for the NLP community, though it is incremental as it adapts an existing framework.
The authors introduced SberQuAD, a large-scale Russian reading comprehension dataset analogous to Stanford SQuAD, and provided its description, analysis, and baseline results.
SberQuAD -- a large scale analog of Stanford SQuAD in the Russian language - is a valuable resource that has not been properly presented to the scientific community. We fill this gap by providing a description, a thorough analysis, and baseline experimental results.