LGAINov 8, 2025

SymLight: Exploring Interpretable and Deployable Symbolic Policies for Traffic Signal Control

arXiv:2511.05790v11 citationsh-index: 13
Originality Incremental advance
AI Analysis

This addresses the need for transparent and efficient traffic signal control policies for urban management, though it is incremental as it builds on existing symbolic and reinforcement learning approaches.

The authors tackled the problem of traffic signal control by developing SymLight, a framework that discovers interpretable and deployable symbolic policies, achieving superior performance compared to baselines on real-world datasets.

Deep Reinforcement Learning have achieved significant success in automatically devising effective traffic signal control (TSC) policies. Neural policies, however, tend to be over-parameterized and non-transparent, hindering their interpretability and deployability on resource-limited edge devices. This work presents SymLight, a priority function search framework based on Monte Carlo Tree Search (MCTS) for discovering inherently interpretable and deployable symbolic priority functions to serve as the TSC policies. The priority function, in particular, accepts traffic features as input and then outputs a priority for each traffic signal phase, which subsequently directs the phase transition. For effective search, we propose a concise yet expressive priority function representation. This helps mitigate the combinatorial explosion of the action space in MCTS. Additionally, a probabilistic structural rollout strategy is introduced to leverage structural patterns from previously discovered high-quality priority functions, guiding the rollout process. Our experiments on real-world datasets demonstrate SymLight's superior performance across a range of baselines. A key advantage is SymLight's ability to produce interpretable and deployable TSC policies while maintaining excellent performance.

Foundations

The foundational work for this paper's niche, ranked by how specifically the neighbourhood builds on it — not by global fame.

Your Notes