AILGMANov 10, 2024

OffLight: An Offline Multi-Agent Reinforcement Learning Framework for Traffic Signal Control

arXiv:2411.06601v32 citationsh-index: 42025 IEEE 21st International Conference on Automation Science and Engineering (CASE)
Originality Incremental advance
AI Analysis

This work addresses the problem of costly online training for urban traffic management by providing a scalable offline framework, though it is incremental in improving existing offline RL methods.

The paper tackles the challenge of offline multi-agent reinforcement learning for traffic signal control by handling heterogeneous behavior policies in real-world datasets, achieving up to a 7.8% reduction in average travel time and 11.2% decrease in queue length.

Efficient traffic control (TSC) is essential for urban mobility, but traditional systems struggle to handle the complexity of real-world traffic. Multi-agent Reinforcement Learning (MARL) offers adaptive solutions, but online MARL requires extensive interactions with the environment, making it costly and impractical. Offline MARL mitigates these challenges by using historical traffic data for training but faces significant difficulties with heterogeneous behavior policies in real-world datasets, where mixed-quality data complicates learning. We introduce OffLight, a novel offline MARL framework designed to handle heterogeneous behavior policies in TSC datasets. To improve learning efficiency, OffLight incorporates Importance Sampling (IS) to correct for distributional shifts and Return-Based Prioritized Sampling (RBPS) to focus on high-quality experiences. OffLight utilizes a Gaussian Mixture Variational Graph Autoencoder (GMM-VGAE) to capture the diverse distribution of behavior policies from local observations. Extensive experiments across real-world urban traffic scenarios show that OffLight outperforms existing offline RL methods, achieving up to a 7.8% reduction in average travel time and 11.2% decrease in queue length. Ablation studies confirm the effectiveness of OffLight's components in handling heterogeneous data and improving policy performance. These results highlight OffLight's scalability and potential to improve urban traffic management without the risks of online learning.

Foundations

The foundational work for this paper's niche, ranked by how specifically the neighbourhood builds on it — not by global fame.

Your Notes