CLMar 14, 2021

A Systematic Review of Reproducibility Research in Natural Language Processing

arXiv:2103.07929v2813 citations
AI Analysis

This addresses the reproducibility crisis in NLP for researchers, but it is incremental as it reviews existing work without proposing new solutions.

The paper tackles the lack of consensus on defining, measuring, and addressing reproducibility in Natural Language Processing by providing a comprehensive review of current research, highlighting diversity rather than convergence in approaches.

Against the background of what has been termed a reproducibility crisis in science, the NLP field is becoming increasingly interested in, and conscientious about, the reproducibility of its results. The past few years have seen an impressive range of new initiatives, events and active research in the area. However, the field is far from reaching a consensus about how reproducibility should be defined, measured and addressed, with diversity of views currently increasing rather than converging. With this focused contribution, we aim to provide a wide-angle, and as near as possible complete, snapshot of current work on reproducibility in NLP, delineating differences and similarities, and providing pointers to common denominators.

Code Implementations1 repo
Foundations

The foundational work for this paper's niche, ranked by how specifically the neighbourhood builds on it — not by global fame.

Your Notes