CVDec 12, 2025

Assisted Refinement Network Based on Channel Information Interaction for Camouflaged and Salient Object Detection

Kuan Wang, Yanjun Qin, Mengge Lu, Liejun Wang, Xiaoming Tao

arXiv:2512.11369v12 citationsh-index: 6Has Code

Originality Incremental advance

AI Analysis

This work addresses the problem of accurately detecting and segmenting camouflaged objects in computer vision, which is incremental as it builds on existing methods by improving feature fusion and boundary handling.

The paper tackled the challenge of Camouflaged Object Detection (COD) by addressing insufficient cross-channel information interaction and ineffective co-modeling of boundary and region information, achieving state-of-the-art performance on four benchmark datasets and demonstrating adaptability across downstream tasks like Salient Object Detection and polyp segmentation.

Camouflaged Object Detection (COD) stands as a significant challenge in computer vision, dedicated to identifying and segmenting objects visually highly integrated with their backgrounds. Current mainstream methods have made progress in cross-layer feature fusion, but two critical issues persist during the decoding stage. The first is insufficient cross-channel information interaction within the same-layer features, limiting feature expressiveness. The second is the inability to effectively co-model boundary and region information, making it difficult to accurately reconstruct complete regions and sharp boundaries of objects. To address the first issue, we propose the Channel Information Interaction Module (CIIM), which introduces a horizontal-vertical integration mechanism in the channel dimension. This module performs feature reorganization and interaction across channels to effectively capture complementary cross-channel information. To address the second issue, we construct a collaborative decoding architecture guided by prior knowledge. This architecture generates boundary priors and object localization maps through Boundary Extraction (BE) and Region Extraction (RE) modules, then employs hybrid attention to collaboratively calibrate decoded features, effectively overcoming semantic ambiguity and imprecise boundaries. Additionally, the Multi-scale Enhancement (MSE) module enriches contextual feature representations. Extensive experiments on four COD benchmark datasets validate the effectiveness and state-of-the-art performance of the proposed model. We further transferred our model to the Salient Object Detection (SOD) task and demonstrated its adaptability across downstream tasks, including polyp segmentation, transparent object detection, and industrial and road defect detection. Code and experimental results are publicly available at: https://github.com/akuan1234/ARNet-v2.

View on arXiv PDF Code

Similar