CVSep 25, 2023

Weakly Supervised Semantic Segmentation by Knowledge Graph Inference

arXiv:2309.14057v23.97 citationsh-index: 7Has Code

Originality Incremental advance

AI Analysis

This work addresses the challenge of improving segmentation accuracy in weakly supervised settings, which is crucial for applications like image analysis where full annotations are costly, though it appears incremental by building on existing CNN-based methods.

The paper tackles the problem of weakly supervised semantic segmentation by enhancing both multi-label classification and segmentation network stages using graph reasoning, achieving state-of-the-art performance on PASCAL VOC 2012 and MS-COCO datasets with image-level supervision.

Currently, existing efforts in Weakly Supervised Semantic Segmentation (WSSS) based on Convolutional Neural Networks (CNNs) have predominantly focused on enhancing the multi-label classification network stage, with limited attention given to the equally important downstream segmentation network. Furthermore, CNN-based local convolutions lack the ability to model the extensive inter-category dependencies. Therefore, this paper introduces a graph reasoning-based approach to enhance WSSS. The aim is to improve WSSS holistically by simultaneously enhancing both the multi-label classification and segmentation network stages. In the multi-label classification network segment, external knowledge is integrated, coupled with GCNs, to globally reason about inter-class dependencies. This encourages the network to uncover features in non-salient regions of images, thereby refining the completeness of generated pseudo-labels. In the segmentation network segment, the proposed Graph Reasoning Mapping (GRM) module is employed to leverage knowledge obtained from textual databases, facilitating contextual reasoning for class representation within image regions. This GRM module enhances feature representation in high-level semantics of the segmentation network's local convolutions, while dynamically learning semantic coherence for individual samples. Using solely image-level supervision, we have achieved state-of-the-art performance in WSSS on the PASCAL VOC 2012 and MS-COCO datasets. Extensive experimentation on both the multi-label classification and segmentation network stages underscores the effectiveness of the proposed graph reasoning approach for advancing WSSS.

View on arXiv PDF Code

Similar