CVApr 28, 2022

Region-level Contrastive and Consistency Learning for Semi-Supervised Semantic Segmentation

arXiv:2204.13314v118 citationsh-index: 24
Originality Incremental advance
AI Analysis

This work improves semi-supervised semantic segmentation for computer vision applications, representing an incremental advancement over existing pixel-level approaches.

The paper tackles the problem of semi-supervised semantic segmentation by proposing a region-level contrastive and consistency learning framework (RC^2L) to address issues with pixel-level methods, such as noise sensitivity and high computational costs. The method achieves state-of-the-art results on PASCAL VOC 2012 and Cityscapes benchmarks.

Current semi-supervised semantic segmentation methods mainly focus on designing pixel-level consistency and contrastive regularization. However, pixel-level regularization is sensitive to noise from pixels with incorrect predictions, and pixel-level contrastive regularization has memory and computational cost with O(pixel_num^2). To address the issues, we propose a novel region-level contrastive and consistency learning framework (RC^2L) for semi-supervised semantic segmentation. Specifically, we first propose a Region Mask Contrastive (RMC) loss and a Region Feature Contrastive (RFC) loss to accomplish region-level contrastive property. Furthermore, Region Class Consistency (RCC) loss and Semantic Mask Consistency (SMC) loss are proposed for achieving region-level consistency. Based on the proposed region-level contrastive and consistency regularization, we develop a region-level contrastive and consistency learning framework (RC^2L) for semi-supervised semantic segmentation, and evaluate our RC$^2$L on two challenging benchmarks (PASCAL VOC 2012 and Cityscapes), outperforming the state-of-the-art.

Foundations

The foundational work for this paper's niche, ranked by how specifically the neighbourhood builds on it — not by global fame.

Your Notes