DCLGDec 13, 2023

Venn: Resource Management for Collaborative Learning Jobs

arXiv:2312.08298v22 citationsh-index: 40Has CodeMLSys
Originality Incremental advance
AI Analysis

This addresses resource contention issues in collaborative learning deployments, offering a practical solution for improving efficiency in distributed ML systems, though it is incremental in nature.

The paper tackles the problem of efficient resource scheduling for collaborative learning jobs across distributed edge devices, proposing Venn, a resource manager that reduces average job completion time by up to 1.88x compared to state-of-the-art methods.

In recent years, collaborative learning (CL) has emerged as a promising approach for machine learning (ML) and data science across distributed edge devices. As the deployment of CL jobs increases, they inevitably contend for limited resources. However, efficient resource scheduling in this context is challenging because of the ephemeral nature and resource heterogeneity of devices, coupled with the overlapping resource requirements of diverse CL jobs. Existing resource managers often assign devices to CL jobs randomly for simplicity and scalability, but this approach compromises job efficiency. In this paper, we present Venn, a CL resource manager that efficiently schedules ephemeral, heterogeneous devices among multiple CL jobs to reduce the average job completion time (JCT). Venn formulates the Intersection Resource Scheduling (IRS) problem to identify complex resource contention among multiple CL jobs. It then proposes a contention-aware scheduling heuristic to minimize the average scheduling delay. Furthermore, it proposes a resource-aware device-to-job matching heuristic to optimize response collection time by mitigating stragglers. Our evaluation shows that, compared to the state-of-the-art CL resource managers, Venn improves the average JCT by up to 1.88x. The code is available at https://github.com/SymbioticLab/Venn.

Code Implementations1 repo
Foundations

The foundational work for this paper's niche, ranked by how specifically the neighbourhood builds on it — not by global fame.

Your Notes