CVROMar 12, 2025

Video Individual Counting for Moving Drones

arXiv:2503.10701v26 citationsh-index: 13
Originality Incremental advance
AI Analysis

This addresses the challenge of accurate individual counting in highly dynamic and crowded surveillance scenarios, though it is incremental as it builds on existing density map approaches.

The paper tackles the problem of Video Individual Counting (VIC) in crowded scenes with fast-moving drones by introducing a new dataset and a method that estimates shared density maps to count unique pedestrians, achieving state-of-the-art results in dynamic conditions.

Video Individual Counting (VIC) has received increasing attention for its importance in intelligent video surveillance. Existing works are limited in two aspects, i.e., dataset and method. Previous datasets are captured with fixed or rarely moving cameras with relatively sparse individuals, restricting evaluation for a highly varying view and time in crowded scenes. Existing methods rely on localization followed by association or classification, which struggle under dense and dynamic conditions due to inaccurate localization of small targets. To address these issues, we introduce the MovingDroneCrowd Dataset, featuring videos captured by fast-moving drones in crowded scenes under diverse illuminations, shooting heights and angles. We further propose a Shared Density map-guided Network (SDNet) using a Depth-wise Cross-Frame Attention (DCFA) module to directly estimate shared density maps between consecutive frames, from which the inflow and outflow density maps are derived by subtracting the shared density maps from the global density maps. The inflow density maps across frames are summed up to obtain the number of unique pedestrians in a video. Experiments on our datasets and publicly available ones show the superiority of our method over the state of the arts in highly dynamic and complex crowded scenes. Our dataset and codes have been released publicly.

Foundations

The foundational work for this paper's niche, ranked by how specifically the neighbourhood builds on it — not by global fame.

Your Notes