CVLGJan 8, 2025

Combining YOLO and Visual Rhythm for Vehicle Counting

arXiv:2501.04534v17 citationsh-index: 3Anais Estendidos da XXXVI Conference on Graphics, Patterns and Images (SIBRAPI Estendido 2023)
Originality Incremental advance
AI Analysis

This incremental improvement addresses efficiency for transport infrastructure management by reducing processing time in vehicle counting applications.

The paper tackles the computational complexity of video-based vehicle counting by eliminating the tracking step and focusing on key frames, achieving 99.15% accuracy and a threefold speed increase compared to tracking-based methods.

Video-based vehicle detection and counting play a critical role in managing transport infrastructure. Traditional image-based counting methods usually involve two main steps: initial detection and subsequent tracking, which are applied to all video frames, leading to a significant increase in computational complexity. To address this issue, this work presents an alternative and more efficient method for vehicle detection and counting. The proposed approach eliminates the need for a tracking step and focuses solely on detecting vehicles in key video frames, thereby increasing its efficiency. To achieve this, we developed a system that combines YOLO, for vehicle detection, with Visual Rhythm, a way to create time-spatial images that allows us to focus on frames that contain useful information. Additionally, this method can be used for counting in any application involving unidirectional moving targets to be detected and identified. Experimental analysis using real videos shows that the proposed method achieves mean counting accuracy around 99.15% over a set of videos, with a processing speed three times faster than tracking based approaches.

Code Implementations1 repo
Foundations

The foundational work for this paper's niche, ranked by how specifically the neighbourhood builds on it — not by global fame.

Your Notes