CVROMar 31, 2025

Video-based Traffic Light Recognition by Rockchip RV1126 for Autonomous Driving

arXiv:2503.23965v12 citationsh-index: 22025 IEEE Intelligent Vehicles Symposium (IV)
Originality Incremental advance
AI Analysis

This addresses safety and navigation challenges in urban autonomous driving by improving robustness in complex scenarios, though it is incremental over single-frame methods.

The paper tackles robust traffic light recognition for autonomous driving by proposing ViTLR, a video-based neural network that processes multiple frames, achieving state-of-the-art performance with real-time processing (>25 FPS) on an embedded platform.

Real-time traffic light recognition is fundamental for autonomous driving safety and navigation in urban environments. While existing approaches rely on single-frame analysis from onboard cameras, they struggle with complex scenarios involving occlusions and adverse lighting conditions. We present \textit{ViTLR}, a novel video-based end-to-end neural network that processes multiple consecutive frames to achieve robust traffic light detection and state classification. The architecture leverages a transformer-like design with convolutional self-attention modules, which is optimized specifically for deployment on the Rockchip RV1126 embedded platform. Extensive evaluations on two real-world datasets demonstrate that \textit{ViTLR} achieves state-of-the-art performance while maintaining real-time processing capabilities (>25 FPS) on RV1126's NPU. The system shows superior robustness across temporal stability, varying target distances, and challenging environmental conditions compared to existing single-frame approaches. We have successfully integrated \textit{ViTLR} into an ego-lane traffic light recognition system using HD maps for autonomous driving applications. The complete implementation, including source code and datasets, is made publicly available to facilitate further research in this domain.

Foundations

The foundational work for this paper's niche, ranked by how specifically the neighbourhood builds on it — not by global fame.

Your Notes