MALGDec 12, 2025

Multi-Objective Reinforcement Learning for Large-Scale Mixed Traffic Control

arXiv:2512.11247v1h-index: 5
Originality Incremental advance
AI Analysis

This addresses equitable service in mixed-autonomy traffic systems, offering incremental improvements through a hierarchical framework.

The paper tackles the problem of mixed traffic control by balancing efficiency, fairness, and safety, resulting in up to 53% reductions in average wait time, up to 86% reductions in maximum starvation, and up to 86% reductions in conflict rate compared to baselines.

Effective mixed traffic control requires balancing efficiency, fairness, and safety. Existing approaches excel at optimizing efficiency and enforcing safety constraints but lack mechanisms to ensure equitable service, resulting in systematic starvation of vehicles on low-demand approaches. We propose a hierarchical framework combining multi-objective reinforcement learning for local intersection control with strategic routing for network-level coordination. Our approach introduces a Conflict Threat Vector that provides agents with explicit risk signals for proactive conflict avoidance, and a queue parity penalty that ensures equitable service across all traffic streams. Extensive experiments on a real-world network across different robot vehicle (RV) penetration rates demonstrate substantial improvements: up to 53% reductions in average wait time, up to 86% reductions in maximum starvation, and up to 86\% reduction in conflict rate compared to baselines, while maintaining fuel efficiency. Our analysis reveals that strategic routing effectiveness scales with RV penetration, becoming increasingly valuable at higher autonomy levels. The results demonstrate that multi-objective optimization through well-curated reward functions paired with strategic RV routing yields significant benefits in fairness and safety metrics critical for equitable mixed-autonomy deployment.

Foundations

The foundational work for this paper's niche, ranked by how specifically the neighbourhood builds on it — not by global fame.

Your Notes