A Learning Method with Gap-Aware Generation for Heterogeneous DAG Scheduling

Ruisong Zhou, Haijun Zou, Li Zhou, Chumin Sun, Zaiwen Wen

arXiv:2603.2324952.81 citationsh-index: 3

AI Analysis

This addresses scheduling challenges for heterogeneous computing systems, though it appears incremental as it builds on existing neural scheduling approaches.

The paper tackles efficient scheduling of directed acyclic graphs (DAGs) in heterogeneous environments by proposing WeCAN, a reinforcement learning framework that addresses task-pool compatibility and generation-induced optimality gaps. Experiments show improved makespan over baselines with inference time comparable to classical heuristics.

Efficient scheduling of directed acyclic graphs (DAGs) in heterogeneous environments is challenging due to resource capacities and dependencies. In practice, the need for adaptability across environments with varying resource pools and task types, alongside rapid schedule generation, complicates these challenges. We propose WeCAN, an end-to-end reinforcement learning framework for heterogeneous DAG scheduling that addresses task--pool compatibility coefficients and generation-induced optimality gaps. It adopts a two-stage single-pass design: a single forward pass produces task--pool scores and global parameters, followed by a generation map that constructs schedules without repeated network calls. Its weighted cross-attention encoder models task--pool interactions gated by compatibility coefficients, and is size-agnostic to environment fluctuations. Moreover, widely used list-scheduling maps can incur generation-induced optimality gaps from restricted reachability. We introduce an order-space analysis that characterizes the reachable set of generation maps via feasible schedule orders, explains the mechanism behind generation-induced gaps, and yields sufficient conditions for gap elimination. Guided by these conditions, we design a skip-extended realization with an analytically parameterized decreasing skip rule, which enlarges the reachable order set while preserving single-pass efficiency. Experiments on computation graphs and real-world TPC-H DAGs demonstrate improved makespan over strong baselines, with inference time comparable to classical heuristics and faster than multi-round neural schedulers.

View on arXiv PDF

Similar