Kanok Boriboonsomsin

h-index42

5papers

2citations

Novelty26%

AI Score43

Ranked #51,482 of 194,257 authors (top 27%)#265 in SY (top 16%)

5 Papers

12.1AIMay 31

Large Language Models in Transportation Systems Management and Operations: From Text Reasoning to Multi-modal Decision Support

Siyan Li, Zehao Wang, Jiachen Li et al.

Transportation systems management and operations (TSMO) increasingly depends on timely interpretation of heterogeneous data, from various sensor streams, incident reports, traveler feedback, and visual observations. Large language models (LLMs), including emerging multi-modal large language models (MM-LLMs), provide a new mechanism for integrating these structured and unstructured inputs into operator-facing decision support. This survey paper reviews LLM- and MM-LLM-based applications in TSMO across three domains: transportation operations & services (supply), mobility & fleet services (demand), and data, modeling & decision support. Using a PRISMA-guided screening process, we synthesize current studies while distinguishing operationally oriented applications from prototype and emerging concepts. We further identify recurring challenges in data heterogeneity, real-time inference, explainability, multi-modal fusion, and governance. Finally, we outline existing gaps and future directions in localized adaptation, edge deployment, benchmarking, and cross-agency collaboration. Overall, LLM-based systems appear most promising as a decision-support layer, with MM-LLMs offering particular value when heterogeneous text, visual, and sensor inputs must be integrated.

8.8SYMay 1

From Sensing to Decision: A Generic Architecture for Freight Signal Priority Systems

Ziyan Zhang, Xuanpeng Zhao, Chuheng Wei et al.

Freight Signal Priority (FSP) systems have emerged as a promising strategy to enhance freight mobility and reduce corridor delays in urban networks. While extensive research has focused on priority control algorithms and operational performance evaluation, comparatively limited attention has been devoted to the architectural design of sensing processes that shape reliable priority decisions. In practice, uncertainties in vehicle detection, communication, and estimated time of arrival (ETA) may propagate within the sensing-to-decision process, affecting priority timing and downstream signal performance. This paper presents a systematic review of FSP systems from a sensing-to-decision perspective. We propose a generic two-layer architecture consisting of a sensing-to-decision layer and a control execution layer. The sensing-to-decision layer transforms sensing inputs into priority decisions, while the control execution layer implements approved actions within traffic controllers. Within this architecture, we systematically compare major sensing modalities, including loop detectors, vision sensors, and V2I, across dimensions such as classification capability, state estimation accuracy, latency, and information richness. We further examine representative FSP systems to analyze how modality-specific characteristics and uncertainties influence ETA computation, priority triggering, and decision reliability. By linking sensing design to decision outcomes, this review identifies key deployment challenges and research gaps in reliability-aware sensing-to-decision design. Ultimately, this work provides a conceptual foundation for developing scalable and robust FSP systems that explicitly account for sensing imperfections rather than assuming idealized inputs.

2.8CVFeb 19

A Multi-modal Detection System for Infrastructure-based Freight Signal Priority

Ziyan Zhang, Chuheng Wei, Xuanpeng Zhao et al.

Freight vehicles approaching signalized intersections require reliable detection and motion estimation to support infrastructure-based Freight Signal Priority (FSP). Accurate and timely perception of vehicle type, position, and speed is essential for enabling effective priority control strategies. This paper presents the design, deployment, and evaluation of an infrastructure-based multi-modal freight vehicle detection system integrating LiDAR and camera sensors. A hybrid sensing architecture is adopted, consisting of an intersection-mounted subsystem and a midblock subsystem, connected via wireless communication for synchronized data transmission. The perception pipeline incorporates both clustering-based and deep learning-based detection methods with Kalman filter tracking to achieve stable real-time performance. LiDAR measurements are registered into geodetic reference frames to support lane-level localization and consistent vehicle tracking. Field evaluations demonstrate that the system can reliably monitor freight vehicle movements at high spatio-temporal resolution. The design and deployment provide practical insights for developing infrastructure-based sensing systems to support FSP applications.

1.4LGFeb 21

HONEST-CAV: Hierarchical Optimization of Network Signals and Trajectories for Connected and Automated Vehicles with Multi-Agent Reinforcement Learning

Ziyan Zhang, Changxin Wan, Peng Hao et al.

This study presents a hierarchical, network-level traffic flow control framework for mixed traffic consisting of Human-driven Vehicles (HVs), Connected and Automated Vehicles (CAVs). The framework jointly optimizes vehicle-level eco-driving behaviors and intersection-level traffic signal control to enhance overall network efficiency and decrease energy consumption. A decentralized Multi-Agent Reinforcement Learning (MARL) approach by Value Decomposition Network (VDN) manages cycle-based traffic signal control (TSC) at intersections, while an innovative Signal Phase and Timing (SPaT) prediction method integrates a Machine Learning-based Trajectory Planning Algorithm (MLTPA) to guide CAVs in executing Eco-Approach and Departure (EAD) maneuvers. The framework is evaluated across varying CAV proportions and powertrain types to assess its effects on mobility and energy performance. Experimental results conducted in a 4*4 real-world network demonstrate that the MARL-based TSC method outperforms the baseline model (i.e., Webster method) in speed, fuel consumption, and idling time. In addition, with MLTPA, HONEST-CAV benefits the traffic system further in energy consumption and idling time. With a 60% CAV proportion, vehicle average speed, fuel consumption, and idling time can be improved/saved by 7.67%, 10.23%, and 45.83% compared with the baseline. Furthermore, discussions on CAV proportions and powertrain types are conducted to quantify the performance of the proposed method with the impact of automation and electrification.

1.2SYJul 17, 2025

Dual LiDAR-Based Traffic Movement Count Estimation at a Signalized Intersection: Deployment, Data Collection, and Preliminary Analysis

Saswat Priyadarshi Nayak, Guoyuan Wu, Kanok Boriboonsomsin et al.

Traffic Movement Count (TMC) at intersections is crucial for optimizing signal timings, assessing the performance of existing traffic control measures, and proposing efficient lane configurations to minimize delays, reduce congestion, and promote safety. Traditionally, methods such as manual counting, loop detectors, pneumatic road tubes, and camera-based recognition have been used for TMC estimation. Although generally reliable, camera-based TMC estimation is prone to inaccuracies under poor lighting conditions during harsh weather and nighttime. In contrast, Light Detection and Ranging (LiDAR) technology is gaining popularity in recent times due to reduced costs and its expanding use in 3D object detection, tracking, and related applications. This paper presents the authors' endeavor to develop, deploy and evaluate a dual-LiDAR system at an intersection in the city of Rialto, California, for TMC estimation. The 3D bounding box detections from the two LiDARs are used to classify vehicle counts based on traffic directions, vehicle movements, and vehicle classes. This work discusses the estimated TMC results and provides insights into the observed trends and irregularities. Potential improvements are also discussed that could enhance not only TMC estimation, but also trajectory forecasting and intent prediction at intersections.