CVAIOct 31, 2024

Localization, balance and affinity: a stronger multifaceted collaborative salient object detector in remote sensing images

arXiv:2410.23991v124 citationsh-index: 19IEEE Trans Geosci Remote Sens
Originality Incremental advance
AI Analysis

This work improves salient object detection for remote sensing applications, but it is incremental as it builds on existing deep learning methods with specific enhancements.

The paper tackled the challenge of accurately detecting salient objects in optical remote sensing images by addressing edge structure and contextual modeling issues, resulting in a proposed method (LBA-MCNet) that outperformed 28 state-of-the-art approaches on three datasets.

Despite significant advancements in salient object detection(SOD) in optical remote sensing images(ORSI), challenges persist due to the intricate edge structures of ORSIs and the complexity of their contextual relationships. Current deep learning approaches encounter difficulties in accurately identifying boundary features and lack efficiency in collaboratively modeling the foreground and background by leveraging contextual features. To address these challenges, we propose a stronger multifaceted collaborative salient object detector in ORSIs, termed LBA-MCNet, which incorporates aspects of localization, balance, and affinity. The network focuses on accurately locating targets, balancing detailed features, and modeling image-level global context information. Specifically, we design the Edge Feature Adaptive Balancing and Adjusting(EFABA) module for precise edge localization, using edge features to guide attention to boundaries and preserve spatial details. Moreover, we design the Global Distributed Affinity Learning(GDAL) module to model global context. It captures global context by generating an affinity map from the encoders final layer, ensuring effective modeling of global patterns. Additionally, deep supervision during deconvolution further enhances feature representation. Finally, we compared with 28 state of the art approaches on three publicly available datasets. The results clearly demonstrate the superiority of our method.

Foundations

The foundational work for this paper's niche, ranked by how specifically the neighbourhood builds on it — not by global fame.

Your Notes