CVApr 7

Toward Unified Fine-Grained Vehicle Classification and Automatic License Plate Recognition

arXiv:2604.0527137.02 citationsh-index: 17Has Code
AI Analysis

This work addresses the need for integrated fine-grained vehicle classification and license plate recognition in intelligent transportation systems, but it is incremental as it primarily introduces a new dataset with some benchmarking.

The authors tackled the problem of extracting vehicle information from surveillance images by introducing UFPR-VeSV, a dataset with 24,945 images and annotations for 13 colors, 26 makes, 136 models, and 14 types, validated using license plate data, and benchmarked with deep learning models to reveal challenges like handling multicolored vehicles and infrared images.

Extracting vehicle information from surveillance images is essential for intelligent transportation systems, enabling applications such as traffic monitoring and criminal investigations. While Automatic License Plate Recognition (ALPR) is widely used, Fine-Grained Vehicle Classification (FGVC) offers a complementary approach by identifying vehicles based on attributes such as color, make, model, and type. Although there have been advances in this field, existing studies often assume well-controlled conditions, explore limited attributes, and overlook FGVC integration with ALPR. To address these gaps, we introduce UFPR-VeSV, a dataset comprising 24,945 images of 16,297 unique vehicles with annotations for 13 colors, 26 makes, 136 models, and 14 types. Collected from the Military Police of Paraná (Brazil) surveillance system, the dataset captures diverse real-world conditions, including partial occlusions, nighttime infrared imaging, and varying lighting. All FGVC annotations were validated using license plate information, with text and corner annotations also being provided. A qualitative and quantitative comparison with established datasets confirmed the challenging nature of our dataset. A benchmark using five deep learning models further validated this, revealing specific challenges such as handling multicolored vehicles, infrared images, and distinguishing between vehicle models that share a common platform. Additionally, we apply two optical character recognition models to license plate recognition and explore the joint use of FGVC and ALPR. The results highlight the potential of integrating these complementary tasks for real-world applications. The UFPR-VeSV dataset is publicly available at: https://github.com/Lima001/UFPR-VeSV-Dataset.

Foundations

The foundational work for this paper's niche, ranked by how specifically the neighbourhood builds on it — not by global fame.

Your Notes