AIJul 16, 2025

Topology Enhanced MARL for Multi-Vehicle Cooperative Decision-Making of CAVs

Ye Han, Lijun Zhang, Dejian Meng, Zhuang Zhang

arXiv:2507.12110v15.81 citationsh-index: 12Has Code

Originality Incremental advance

AI Analysis

This work addresses cooperative decision-making for CAVs in mixed traffic, offering incremental improvements over existing MARL methods.

The paper tackles the exploration-exploitation challenge in multi-agent reinforcement learning for connected autonomous vehicles by proposing a topology-enhanced MARL method, which improves traffic efficiency, safety, and decision smoothness in simulations across varying conditions.

The exploration-exploitation trade-off constitutes one of the fundamental challenges in reinforcement learning (RL), which is exacerbated in multi-agent reinforcement learning (MARL) due to the exponential growth of joint state-action spaces. This paper proposes a topology-enhanced MARL (TPE-MARL) method for optimizing cooperative decision-making of connected and autonomous vehicles (CAVs) in mixed traffic. This work presents two primary contributions: First, we construct a game topology tensor for dynamic traffic flow, effectively compressing high-dimensional traffic state information and decrease the search space for MARL algorithms. Second, building upon the designed game topology tensor and using QMIX as the backbone RL algorithm, we establish a topology-enhanced MARL framework incorporating visit counts and agent mutual information. Extensive simulations across varying traffic densities and CAV penetration rates demonstrate the effectiveness of TPE-MARL. Evaluations encompassing training dynamics, exploration patterns, macroscopic traffic performance metrics, and microscopic vehicle behaviors reveal that TPE-MARL successfully balances exploration and exploitation. Consequently, it exhibits superior performance in terms of traffic efficiency, safety, decision smoothness, and task completion. Furthermore, the algorithm demonstrates decision-making rationality comparable to or exceeding that of human drivers in both mixed-autonomy and fully autonomous traffic scenarios. Code of our work is available at \href{https://github.com/leoPub/tpemarl}{https://github.com/leoPub/tpemarl}.

View on arXiv PDF Code

Similar