LGAIDCSESep 22, 2025

Cluster Workload Allocation: A Predictive Approach Leveraging Machine Learning Efficiency

arXiv:2509.17695v14 citationsh-index: 5IEEE Access
Originality Synthesis-oriented
AI Analysis

This addresses cluster efficiency for data center operators, but it is incremental as it applies existing ML methods to a specific workload allocation problem.

The study tackled workload allocation in clusters by using machine learning to predict suitable node-task pairings for tasks with node affinity constraints, achieving 98% accuracy and a 1.5-1.8% misclassification rate for tasks with a single suitable node.

This research investigates how Machine Learning (ML) algorithms can assist in workload allocation strategies by detecting tasks with node affinity operators (referred to as constraint operators), which constrain their execution to a limited number of nodes. Using real-world Google Cluster Data (GCD) workload traces and the AGOCS framework, the study extracts node attributes and task constraints, then analyses them to identify suitable node-task pairings. It focuses on tasks that can be executed on either a single node or fewer than a thousand out of 12.5k nodes in the analysed GCD cluster. Task constraint operators are compacted, pre-processed with one-hot encoding, and used as features in a training dataset. Various ML classifiers, including Artificial Neural Networks, K-Nearest Neighbours, Decision Trees, Naive Bayes, Ridge Regression, Adaptive Boosting, and Bagging, are fine-tuned and assessed for accuracy and F1-scores. The final ensemble voting classifier model achieved 98% accuracy and a 1.5-1.8% misclassification rate for tasks with a single suitable node.

Foundations

The foundational work for this paper's niche, ranked by how specifically the neighbourhood builds on it — not by global fame.

Your Notes