CVSep 15, 2024

GLCONet: Learning Multi-source Perception Representation for Camouflaged Object Detection

arXiv:2409.09588v131 citationsh-index: 11Has Code
Originality Incremental advance
AI Analysis

This addresses the problem of detecting camouflaged objects in images, which is important for applications like surveillance and biology, but the approach is incremental as it builds on existing biological perception methods.

The paper tackles camouflaged object detection by proposing GLCONet, a network that models local details and global long-range dependencies to improve accuracy, outperforming 20 state-of-the-art methods on three public datasets.

Recently, biological perception has been a powerful tool for handling the camouflaged object detection (COD) task. However, most existing methods are heavily dependent on the local spatial information of diverse scales from convolutional operations to optimize initial features. A commonly neglected point in these methods is the long-range dependencies between feature pixels from different scale spaces that can help the model build a global structure of the object, inducing a more precise image representation. In this paper, we propose a novel Global-Local Collaborative Optimization Network, called GLCONet. Technically, we first design a collaborative optimization strategy from the perspective of multi-source perception to simultaneously model the local details and global long-range relationships, which can provide features with abundant discriminative information to boost the accuracy in detecting camouflaged objects. Furthermore, we introduce an adjacent reverse decoder that contains cross-layer aggregation and reverse optimization to integrate complementary information from different levels for generating high-quality representations. Extensive experiments demonstrate that the proposed GLCONet method with different backbones can effectively activate potentially significant pixels in an image, outperforming twenty state-of-the-art methods on three public COD datasets. The source code is available at: \https://github.com/CSYSI/GLCONet.

Code Implementations1 repo
Foundations

The foundational work for this paper's niche, ranked by how specifically the neighbourhood builds on it — not by global fame.

Your Notes