SEDCMar 27

Efficiently Reproducing Distributed Workflows in Notebook-based Systems

arXiv:2603.2696517.4h-index: 33
AI Analysis

This work addresses the challenge of reproducing distributed workflows in notebook-based systems, which is a problem for data scientists and researchers who rely on iterative development and sharing of such workflows.

NBRewind introduces a notebook kernel system that enables efficient and reproducible execution of distributed workflows by using incremental, cell-level checkpointing and partial re-execution, achieving minimal overhead and cross-site reproducibility on HPC systems.

Notebooks provide an author-friendly environment for iterative development, modular execution, and easy sharing. Distributed workflows are increasingly being authored and executed in notebooks, yet sharing and reproducing them remains challenging. Even small code or parameter changes often force full end-to-end re-execution of the distributed workflow, limiting iterative development for such workloads. Current methods for improving notebook execution operate on single-node workflows, while optimization techniques for distributed workflows typically sacrifice reproducibility. We introduce NBRewind, a notebook kernel system for efficient, reproducible execution of distributed workflows in notebooks. NBRewind consists of two kernels--audit and repeat. The audit kernel performs incremental, cell-level checkpointing to avoid unnecessary re-runs; repeat reconstructs checkpoints and enables partial re-execution including notebook cells that manage distributed workflow. Both kernel methods are based on data-flow analysis across cells. We show how checkpoints and logs when packaged as part of standardized notebook specification improve sharing and reproducibility. Using real-world case studies we show that creating incremental checkpoints adds minimal overhead and enables portable, cross-site reproducibility of notebook-based distributed workflows on HPC systems.

Foundations

The foundational work for this paper's niche, ranked by how specifically the neighbourhood builds on it — not by global fame.

Your Notes