CVAIMay 8, 2025

Enhancing Satellite Object Localization with Dilated Convolutions and Attention-aided Spatial Pooling

arXiv:2505.05599v32 citationsh-index: 5Has Code2025 International Conference on Advanced Machine Learning and Data Science (AMLDS)
Originality Incremental advance
AI Analysis

This addresses the problem of accurate object detection in satellite imagery for atmospheric and oceanographic research, representing an incremental improvement over existing methods.

The paper tackles object localization in satellite imagery by introducing YOLO-DCAP, an enhanced version of YOLOv5 with multi-scale dilated convolutions and attention-aided spatial pooling, achieving average improvements of 20.95% in mAP50 and 32.23% in IoU over the base model.

Object localization in satellite imagery is particularly challenging due to the high variability of objects, low spatial resolution, and interference from noise and dominant features such as clouds and city lights. In this research, we focus on three satellite datasets: upper atmospheric Gravity Waves (GW), mesospheric Bores (Bore), and Ocean Eddies (OE), each presenting its own unique challenges. These challenges include the variability in the scale and appearance of the main object patterns, where the size, shape, and feature extent of objects of interest can differ significantly. To address these challenges, we introduce YOLO-DCAP, a novel enhanced version of YOLOv5 designed to improve object localization in these complex scenarios. YOLO-DCAP incorporates a Multi-scale Dilated Residual Convolution (MDRC) block to capture multi-scale features at scale with varying dilation rates, and an Attention-aided Spatial Pooling (AaSP) module to focus on the global relevant spatial regions, enhancing feature selection. These structural improvements help to better localize objects in satellite imagery. Experimental results demonstrate that YOLO-DCAP significantly outperforms both the YOLO base model and state-of-the-art approaches, achieving an average improvement of 20.95% in mAP50 and 32.23% in IoU over the base model, and 7.35% and 9.84% respectively over state-of-the-art alternatives, consistently across all three satellite datasets. These consistent gains across all three satellite datasets highlight the robustness and generalizability of the proposed approach. Our code is open sourced at https://github.com/AI-4-atmosphere-remote-sensing/satellite-object-localization.

Code Implementations1 repo
Foundations

The foundational work for this paper's niche, ranked by how specifically the neighbourhood builds on it — not by global fame.

Your Notes