MALGJan 22, 2025

An Offline Multi-Agent Reinforcement Learning Framework for Radio Resource Management

arXiv:2501.12991v15 citationsh-index: 9IEEE Trans Mob Comput
Originality Incremental advance
AI Analysis

This work addresses resource management in dynamic wireless networks, offering a scalable and efficient solution, though it is incremental as it applies existing offline MARL paradigms to a specific domain.

The paper tackled radio resource management by optimizing scheduling policies for multiple access points using offline multi-agent reinforcement learning, achieving over a 15% improvement in a weighted combination of sum and tail rates compared to conventional baselines.

Offline multi-agent reinforcement learning (MARL) addresses key limitations of online MARL, such as safety concerns, expensive data collection, extended training intervals, and high signaling overhead caused by online interactions with the environment. In this work, we propose an offline MARL algorithm for radio resource management (RRM), focusing on optimizing scheduling policies for multiple access points (APs) to jointly maximize the sum and tail rates of user equipment (UEs). We evaluate three training paradigms: centralized, independent, and centralized training with decentralized execution (CTDE). Our simulation results demonstrate that the proposed offline MARL framework outperforms conventional baseline approaches, achieving over a 15\% improvement in a weighted combination of sum and tail rates. Additionally, the CTDE framework strikes an effective balance, reducing the computational complexity of centralized methods while addressing the inefficiencies of independent training. These results underscore the potential of offline MARL to deliver scalable, robust, and efficient solutions for resource management in dynamic wireless networks.

Foundations

The foundational work for this paper's niche, ranked by how specifically the neighbourhood builds on it — not by global fame.

Your Notes