LGDec 27, 2021

Intelligent Traffic Light via Policy-based Deep Reinforcement Learning

arXiv:2112.13817v1
Originality Incremental advance
AI Analysis

This addresses traffic management in smart cities, but it is incremental as it applies an existing reinforcement learning method to a known problem.

The study tackled traffic congestion by using Proximal Policy Optimization (PPO) to train an intelligent traffic light control agent, finding that PPO outperformed DQN and DDQN methods and that variable time intervals improved traffic flow.

Intelligent traffic lights in smart cities can optimally reduce traffic congestion. In this study, we employ reinforcement learning to train the control agent of a traffic light on a simulator of urban mobility. As a difference from existing works, a policy-based deep reinforcement learning method, Proximal Policy Optimization (PPO), is utilized other than value-based methods such as Deep Q Network (DQN) and Double DQN (DDQN). At first, the obtained optimal policy from PPO is compared to those from DQN and DDQN. It is found that the policy from PPO performs better than the others. Next, instead of the fixed-interval traffic light phases, we adopt the light phases with variable time intervals, which result in a better policy to pass the traffic flow. Then, the effects of environment and action disturbances are studied to demonstrate the learning-based controller is robust. At last, we consider unbalanced traffic flows and find that an intelligent traffic light can perform moderately well for the unbalanced traffic scenarios, although it learns the optimal policy from the balanced traffic scenarios only.

Code Implementations1 repo
Foundations

The foundational work for this paper's niche, ranked by how specifically the neighbourhood builds on it — not by global fame.

Your Notes