DC AIJul 1, 2024

Reinforcement Learning-driven Data-intensive Workflow Scheduling for Volunteer Edge-Cloud

Motahare Mounesan, Mauro Lemus, Hemanth Yeddulapalli, Prasad Calyam, Saptarshi Debroy

arXiv:2407.01428v15.912 citationsh-index: 13

Originality Incremental advance

AI Analysis

This work addresses scheduling inefficiencies for data-intensive workflows in distributed, heterogeneous volunteer computing systems, representing an incremental improvement over existing methods.

The paper tackles the challenge of scheduling data-intensive scientific workflows in Volunteer Edge-Cloud (VEC) environments by proposing a Reinforcement Learning-driven approach that optimizes long-term average performance, demonstrating benefits over baseline strategies in workflow and resource satisfaction.

In recent times, Volunteer Edge-Cloud (VEC) has gained traction as a cost-effective, community computing paradigm to support data-intensive scientific workflows. However, due to the highly distributed and heterogeneous nature of VEC resources, centralized workflow task scheduling remains a challenge. In this paper, we propose a Reinforcement Learning (RL)-driven data-intensive scientific workflow scheduling approach that takes into consideration: i) workflow requirements, ii) VEC resources' preference on workflows, and iii) diverse VEC resource policies, to ensure robust resource allocation. We formulate the long-term average performance optimization problem as a Markov Decision Process, which is solved using an event-based Asynchronous Advantage Actor-Critic RL approach. Our extensive simulations and testbed implementations demonstrate our approach's benefits over popular baseline strategies in terms of workflow requirement satisfaction, VEC preference satisfaction, and available VEC resource utilization.

View on arXiv PDF

Similar