CVAILGJun 15, 2025

MGDFIS: Multi-scale Global-detail Feature Integration Strategy for Small Object Detection

arXiv:2506.12697v24 citationsh-index: 3
Originality Highly original
AI Analysis

This addresses the challenge of detecting small objects in cluttered UAV scenes for applications like search-and-rescue, offering a practical solution for resource-constrained platforms.

The paper tackles the problem of small object detection in UAV imagery by proposing the Multi-scale Global-detail Feature Integration Strategy (MGDFIS), which integrates global context with local detail to improve detection performance while maintaining efficiency, achieving superior precision and recall on the VisDrone benchmark with low inference time.

Small object detection in UAV imagery is crucial for applications such as search-and-rescue, traffic monitoring, and environmental surveillance, but it is hampered by tiny object size, low signal-to-noise ratios, and limited feature extraction. Existing multi-scale fusion methods help, but add computational burden and blur fine details, making small object detection in cluttered scenes difficult. To overcome these challenges, we propose the Multi-scale Global-detail Feature Integration Strategy (MGDFIS), a unified fusion framework that tightly couples global context with local detail to boost detection performance while maintaining efficiency. MGDFIS comprises three synergistic modules: the FusionLock-TSS Attention Module, which marries token-statistics self-attention with DynamicTanh normalization to highlight spectral and spatial cues at minimal cost; the Global-detail Integration Module, which fuses multi-scale context via directional convolution and parallel attention while preserving subtle shape and texture variations; and the Dynamic Pixel Attention Module, which generates pixel-wise weighting maps to rebalance uneven foreground and background distributions and sharpen responses to true object regions. Extensive experiments on the VisDrone benchmark demonstrate that MGDFIS consistently outperforms state-of-the-art methods across diverse backbone architectures and detection frameworks, achieving superior precision and recall with low inference time. By striking an optimal balance between accuracy and resource usage, MGDFIS provides a practical solution for small-object detection on resource-constrained UAV platforms.

Foundations

The foundational work for this paper's niche, ranked by how specifically the neighbourhood builds on it — not by global fame.

Your Notes