LGMar 1, 2025

Scalable Reinforcement Learning for Virtual Machine Scheduling

arXiv:2503.00537v1h-index: 12Has Code
Originality Incremental advance
AI Analysis

This addresses scalability limitations in cloud infrastructure optimization, representing a significant leap but is incremental in extending RL to larger clusters.

The paper tackles the problem of scaling reinforcement learning for virtual machine scheduling in large-scale cloud computing, achieving applicability to environments with up to 50 physical machines, surpassing state-of-the-art methods in generalization across various scenarios.

Recent advancements in reinforcement learning (RL) have shown promise for optimizing virtual machine scheduling (VMS) in small-scale clusters. The utilization of RL to large-scale cloud computing scenarios remains notably constrained. This paper introduces a scalable RL framework, called Cluster Value Decomposition Reinforcement Learning (CVD-RL), to surmount the scalability hurdles inherent in large-scale VMS. The CVD-RL framework innovatively combines a decomposition operator with a look-ahead operator to adeptly manage representation complexities, while complemented by a Top-$k$ filter operator that refines exploration efficiency. Different from existing approaches limited to clusters of $10$ or fewer physical machines (PMs), CVD-RL extends its applicability to environments encompassing up to $50$ PMs. Furthermore, the CVD-RL framework demonstrates generalization capabilities that surpass contemporary SOTA methodologies across a variety of scenarios in empirical studies. This breakthrough not only showcases the framework's exceptional scalability and performance but also represents a significant leap in the application of RL for VMS within complex, large-scale cloud infrastructures. The code is available at https://anonymous.4open.science/r/marl4sche-D0FE.

Foundations

The foundational work for this paper's niche, ranked by how specifically the neighbourhood builds on it — not by global fame.

Your Notes