DCApr 29

DMRlib: Easy-coding and Efficient Resource Management for Job Malleability

arXiv:2604.2662438.526 citations
Predicted impact top 42% in DC · last 90 daysOriginality Incremental advance
AI Analysis

For scientific programmers, DMRlib reduces the development effort for process malleability, which improves resource utilization in data centers.

DMRlib is a library that simplifies the development of malleable MPI applications with a minimalist syntax. Experiments show it can improve global throughput by over 3x compared to non-malleable jobs.

Process malleability has proved to have a highly positive impact on the resource utilization and global productivity in data centers compared with the conventional static resource allocation policy. However, the non-negligible additional development effort this solution imposes has constrained its adoption by the scientific programming community. In this work, we present DMRlib, a library designed to offer the global advantages of process malleability while providing a minimalist MPI-like syntax. The library includes a series of predefined communication patterns that greatly ease the development of malleable applications. In addition, we deploy several scenarios to demonstrate the positive impact of process malleability featuring different scalability patterns. Concretely, we study two job submission modes (rigid and moldable) in order to identify the best-case scenarios for malleability using metrics such as resource allocation rate, completed jobs per second, and energy consumption. The experiments prove that our elastic approach may improve global throughput by a factor higher than 3x compared to the traditional workloads of non-malleable jobs.

Foundations

The foundational work for this paper's niche, ranked by how specifically the neighbourhood builds on it — not by global fame.

Your Notes