CVOct 17, 2024

RemoteDet-Mamba: A Hybrid Mamba-CNN Network for Multi-modal Object Detection in Remote Sensing Images

arXiv:2410.13532v114 citationsh-index: 11
Originality Incremental advance
AI Analysis

This work addresses object detection challenges in remote sensing for applications like emergency response, but it appears incremental as it builds on existing Mamba and CNN methods.

The paper tackled the problem of detecting small, densely distributed objects in UAV remote sensing images by proposing RemoteDet-Mamba, a hybrid Mamba-CNN network, which achieved superior detection accuracy on the DroneVehicle dataset while maintaining computational efficiency.

Unmanned aerial vehicle (UAV) remote sensing is widely applied in fields such as emergency response, owing to its advantages of rapid information acquisition and low cost. However, due to the effects of shooting distance and imaging mechanisms, the objects in the images present challenges such as small size, dense distribution, and low inter-class differentiation. To this end, we propose a multimodal remote sensing detection network that employs a quad-directional selective scanning fusion strategy called RemoteDet-Mamba. RemoteDet-Mamba simultaneously facilitates the learning of single-modal local features and the integration of patch-level global features across modalities, enhancing the distinguishability for small objects and utilizing local information to improve discrimination between different classes. Additionally, the use of Mamba's serial processing significantly increases detection speed. Experimental results on the DroneVehicle dataset demonstrate the effectiveness of RemoteDet-Mamba, which achieves superior detection accuracy compared to state-of-the-art methods while maintaining computational efficiency and parameter count.

Foundations

The foundational work for this paper's niche, ranked by how specifically the neighbourhood builds on it — not by global fame.

Your Notes