SELGFeb 2, 2025

More Rigorous Software Engineering Would Improve Reproducibility in Machine Learning Research

arXiv:2502.00902v22 citationsh-index: 3
AI Analysis

This addresses reproducibility issues in the ML community, which is an incremental improvement in research practices.

The paper surveyed software best practices in machine learning research repositories from major conferences and journals, identifying gaps and providing recommendations to improve reproducibility.

While experimental reproduction remains a pillar of the scientific method, we observe that the software best practices supporting the reproduction of machine learning ( ML ) research are often undervalued or overlooked, leading both to poor reproducibility and damage to trust in the ML community. We quantify these concerns by surveying the usage of software best practices in software repositories associated with publications at major ML conferences and journals such as NeurIPS, ICML, ICLR, TMLR, and MLOSS within the last decade. We report the results of this survey that identify areas where software best practices are lacking and areas with potential for growth in the ML community. Finally, we discuss the implications and present concrete recommendations on how we, as a community, can improve reproducibility in ML research.

Code Implementations1 repo
Foundations

The foundational work for this paper's niche, ranked by how specifically the neighbourhood builds on it — not by global fame.

Your Notes