CVIVMar 28, 2025

Data Quality Matters: Quantifying Image Quality Impact on Machine Learning Performance

arXiv:2503.22375v15 citationsh-index: 92025 IEEE Intelligent Vehicles Symposium (IV)
Originality Synthesis-oriented
AI Analysis

This addresses data quality issues for automated driving systems, but it is incremental as it applies existing metrics to a new domain.

The paper tackles the problem of how image modifications from compression and virtualization degrade machine learning performance in automated driving, finding that the LPIPS metric best correlates image quality with model accuracy.

Precise perception of the environment is essential in highly automated driving systems, which rely on machine learning tasks such as object detection and segmentation. Compression of sensor data is commonly used for data handling, while virtualization is used for hardware-in-the-loop validation. Both methods can alter sensor data and degrade model performance. This necessitates a systematic approach to quantifying image validity. This paper presents a four-step framework to evaluate the impact of image modifications on machine learning tasks. First, a dataset with modified images is prepared to ensure one-to-one matching image pairs, enabling measurement of deviations resulting from compression and virtualization. Second, image deviations are quantified by comparing the effects of compression and virtualization against original camera-based sensor data. Third, the performance of state-of-the-art object detection models is analyzed to determine how altered input data affects perception tasks, including bounding box accuracy and reliability. Finally, a correlation analysis is performed to identify relationships between image quality and model performance. As a result, the LPIPS metric achieves the highest correlation between image deviation and machine learning performance across all evaluated machine learning tasks.

Foundations

The foundational work for this paper's niche, ranked by how specifically the neighbourhood builds on it — not by global fame.

Your Notes