CVSep 30, 2025

Multi-View Camera System for Variant-Aware Autonomous Vehicle Inspection and Defect Detection

arXiv:2509.26454v1h-index: 7
Originality Incremental advance
AI Analysis

This addresses quality control for automotive manufacturers by providing a real-time, deployable system, though it is incremental as it combines existing methods in a novel application.

The paper tackles the challenge of verifying vehicle variant specifications and detecting visible defects on production lines by presenting an Automated Vehicle Inspection platform that uses multi-view cameras and deep learning, achieving 93% verification accuracy, 86% defect-detection recall, and a throughput of 3.3 vehicles per minute.

Ensuring that every vehicle leaving a modern production line is built to the correct \emph{variant} specification and is free from visible defects is an increasingly complex challenge. We present the \textbf{Automated Vehicle Inspection (AVI)} platform, an end-to-end, \emph{multi-view} perception system that couples deep-learning detectors with a semantic rule engine to deliver \emph{variant-aware} quality control in real time. Eleven synchronized cameras capture a full 360° sweep of each vehicle; task-specific views are then routed to specialised modules: YOLOv8 for part detection, EfficientNet for ICE/EV classification, Gemini-1.5 Flash for mascot OCR, and YOLOv8-Seg for scratch-and-dent segmentation. A view-aware fusion layer standardises evidence, while a VIN-conditioned rule engine compares detected features against the expected manifest, producing an interpretable pass/fail report in \(\approx\! 300\,\text{ms}\). On a mixed data set of Original Equipment Manufacturer(OEM) vehicle data sets of four distinct models plus public scratch/dent images, AVI achieves \textbf{ 93 \%} verification accuracy, \textbf{86 \%} defect-detection recall, and sustains \(\mathbf{3.3}\) vehicles/min, surpassing single-view or no segmentation baselines by large margins. To our knowledge, this is the first publicly reported system that unifies multi-camera feature validation with defect detection in a deployable automotive setting in industry.

Foundations

The foundational work for this paper's niche, ranked by how specifically the neighbourhood builds on it — not by global fame.

Your Notes