CVAug 30, 2025

C-DiffDet+: Fusing Global Scene Context with Generative Denoising for High-Fidelity Car Damage Detection

Abdellah Zakaria Sellam, Ilyes Benaissa, Salah Eddine Bekhouche, Abdenour Hadid, Vito Renó, Cosimo Distante

arXiv:2509.00578v43.6h-index: 35

Originality Incremental advance

AI Analysis

This work addresses the challenge of reliable vehicle damage assessment for automotive and insurance industries, representing an incremental improvement in context-aware object detection.

The paper tackled the problem of fine-grained car damage detection by introducing Context-Aware Fusion to integrate global scene context with local features, resulting in improved performance over state-of-the-art models on the CarDD benchmark.

Fine-grained object detection in challenging visual domains, such as vehicle damage assessment, presents a formidable challenge even for human experts to resolve reliably. While DiffusionDet has advanced the state-of-the-art through conditional denoising diffusion, its performance remains limited by local feature conditioning in context-dependent scenarios. We address this fundamental limitation by introducing Context-Aware Fusion (CAF), which leverages cross-attention mechanisms to integrate global scene context with local proposal features directly. The global context is generated using a separate dedicated encoder that captures comprehensive environmental information, enabling each object proposal to attend to scene-level understanding. Our framework significantly enhances the generative detection paradigm by enabling each object proposal to attend to comprehensive environmental information. Experimental results demonstrate an improvement over state-of-the-art models on the CarDD benchmark, establishing new performance benchmarks for context-aware object detection in fine-grained domains

View on arXiv PDF

Similar