CVDec 23, 2025

CoDi -- an exemplar-conditioned diffusion model for low-shot counting

arXiv:2512.20153v11 citationsh-index: 23Has Code
Originality Highly original
AI Analysis

This addresses the challenge of counting previously unobserved objects in images with few or no exemplars, particularly in dense scenarios, representing a strong specific gain rather than an incremental improvement.

The paper tackles the problem of low-shot object counting in dense regions with small objects by proposing CoDi, a latent diffusion-based model that produces high-quality density maps for accurate object localization. The method outperforms state-of-the-art approaches by 15% MAE on FSC benchmark and 44% MAE on MCAC benchmark.

Low-shot object counting addresses estimating the number of previously unobserved objects in an image using only few or no annotated test-time exemplars. A considerable challenge for modern low-shot counters are dense regions with small objects. While total counts in such situations are typically well addressed by density-based counters, their usefulness is limited by poor localization capabilities. This is better addressed by point-detection-based counters, which are based on query-based detectors. However, due to limited number of pre-trained queries, they underperform on images with very large numbers of objects, and resort to ad-hoc techniques like upsampling and tiling. We propose CoDi, the first latent diffusion-based low-shot counter that produces high-quality density maps on which object locations can be determined by non-maxima suppression. Our core contribution is the new exemplar-based conditioning module that extracts and adjusts the object prototypes to the intermediate layers of the denoising network, leading to accurate object location estimation. On FSC benchmark, CoDi outperforms state-of-the-art by 15% MAE, 13% MAE and 10% MAE in the few-shot, one-shot, and reference-less scenarios, respectively, and sets a new state-of-the-art on MCAC benchmark by outperforming the top method by 44% MAE. The code is available at https://github.com/gsustar/CoDi.

Foundations

The foundational work for this paper's niche, ranked by how specifically the neighbourhood builds on it — not by global fame.

Your Notes