CYSEApr 9, 2021

Agile (data) science: a (draft) manifesto

arXiv:2104.12545v31 citations
Originality Synthesis-oriented
AI Analysis

This addresses reproducibility and efficiency issues in academic science, but it is incremental as it proposes applying existing industrial methods to academia.

The paper argues that academia should adopt agile data science practices to address data and project management problems, aiming for more responsible and reproducible science.

Science has a data management problem, as well as a project management problem. While industrial-grade data science teams have embraced the agile mindset, and adopted or created all kind of tools to create reproducible workflows, academia-based science is still (mostly) mired in a mindset that is focused on a single final product (a paper), without focusing on incremental improvement, on any specific problem or customer, or, paying any attention reproducibility. In this report we argue towards the adoption of the agile mindset and agile data science tools in academia, to make a more responsible, and over all, reproducible science.

Code Implementations1 repo
Foundations

The foundational work for this paper's niche, ranked by how specifically the neighbourhood builds on it — not by global fame.

Your Notes