LGDec 27, 2021

Intelligent Traffic Light via Policy-based Deep Reinforcement Learning

Yue Zhu, Mingyu Cai, Chris Schwarz, Junchao Li, Shaoping Xiao

arXiv:2112.13817v11.6Has Code

Originality Incremental advance

AI Analysis

This addresses traffic management in smart cities, but it is incremental as it applies an existing reinforcement learning method to a known problem.

The study tackled traffic congestion by using Proximal Policy Optimization (PPO) to train an intelligent traffic light control agent, finding that PPO outperformed DQN and DDQN methods and that variable time intervals improved traffic flow.

Intelligent traffic lights in smart cities can optimally reduce traffic congestion. In this study, we employ reinforcement learning to train the control agent of a traffic light on a simulator of urban mobility. As a difference from existing works, a policy-based deep reinforcement learning method, Proximal Policy Optimization (PPO), is utilized other than value-based methods such as Deep Q Network (DQN) and Double DQN (DDQN). At first, the obtained optimal policy from PPO is compared to those from DQN and DDQN. It is found that the policy from PPO performs better than the others. Next, instead of the fixed-interval traffic light phases, we adopt the light phases with variable time intervals, which result in a better policy to pass the traffic flow. Then, the effects of environment and action disturbances are studied to demonstrate the learning-based controller is robust. At last, we consider unbalanced traffic flows and find that an intelligent traffic light can perform moderately well for the unbalanced traffic scenarios, although it learns the optimal policy from the balanced traffic scenarios only.

View on arXiv PDF Code

Similar