CVAIMay 30, 2025

Deformable Attention Mechanisms Applied to Object Detection, case of Remote Sensing

arXiv:2505.24489v13 citationsh-index: 5
Originality Synthesis-oriented
AI Analysis

This work addresses object detection in remote sensing, an important domain for geographical analysis, but it is incremental as it applies an existing method to new data types.

The authors applied the Deformable-DETR model with deformable attention mechanisms to object detection in remote sensing images, achieving F1 scores of 95.12% on an optical dataset and 94.54% on a SAR dataset.

Object detection has recently seen an interesting trend in terms of the most innovative research work, this task being of particular importance in the field of remote sensing, given the consistency of these images in terms of geographical coverage and the objects present. Furthermore, Deep Learning (DL) models, in particular those based on Transformers, are especially relevant for visual computing tasks in general, and target detection in particular. Thus, the present work proposes an application of Deformable-DETR model, a specific architecture using deformable attention mechanisms, on remote sensing images in two different modes, especially optical and Synthetic Aperture Radar (SAR). To achieve this objective, two datasets are used, one optical, which is Pleiades Aircraft dataset, and the other SAR, in particular SAR Ship Detection Dataset (SSDD). The results of a 10-fold stratified validation showed that the proposed model performed particularly well, obtaining an F1 score of 95.12% for the optical dataset and 94.54% for SSDD, while comparing these results with several models detections, especially those based on CNNs and transformers, as well as those specifically designed to detect different object classes in remote sensing images.

Foundations

The foundational work for this paper's niche, ranked by how specifically the neighbourhood builds on it — not by global fame.

Your Notes