LGMSSEJul 26, 2021

MLDev: Data Science Experiment Automation and Reproducibility Software

arXiv:2107.12322v14 citationsHas Code
Originality Synthesis-oriented
AI Analysis

This addresses the problem of experiment automation and reproducibility for data scientists, but appears incremental as it builds on existing open-source tools.

The paper tackles the challenge of automating experiments in data science by proposing an extensible experiment model and implementing it in the MLDev software package, with evaluation showing promising results and novelty compared to state-of-the-art tools.

In this paper we explore the challenges of automating experiments in data science. We propose an extensible experiment model as a foundation for integration of different open source tools for running research experiments. We implement our approach in a prototype open source MLDev software package and evaluate it in a series of experiments yielding promising results. Comparison with other state-of-the-art tools signifies novelty of our approach.

Code Implementations1 repo
Foundations

The foundational work for this paper's niche, ranked by how specifically the neighbourhood builds on it — not by global fame.

Your Notes