CVFeb 27, 2024

Weakly Supervised Co-training with Swapping Assignments for Semantic Segmentation

arXiv:2402.17891v20.1310 citationsh-index: 7Has CodeECCV
AI Analysis85

This work addresses the reliance on offline refinement processes in weakly supervised semantic segmentation, offering a more efficient and generalizable solution for researchers and practitioners in computer vision.

The paper tackles the problem of inconsistent and erroneous class activation maps in weakly supervised semantic segmentation by proposing an end-to-end model called CoSA, which achieves mIoU of 76.2% on VOC and 51.0% on COCO, outperforming existing baselines and being the first single-stage method to surpass multi-stage ones.

Class activation maps (CAMs) are commonly employed in weakly supervised semantic segmentation (WSSS) to produce pseudo-labels. Due to incomplete or excessive class activation, existing studies often resort to offline CAM refinement, introducing additional stages or proposing offline modules. This can cause optimization difficulties for single-stage methods and limit generalizability. In this study, we aim to reduce the observed CAM inconsistency and error to mitigate reliance on refinement processes. We propose an end-to-end WSSS model incorporating guided CAMs, wherein our segmentation model is trained while concurrently optimizing CAMs online. Our method, Co-training with Swapping Assignments (CoSA), leverages a dual-stream framework, where one sub-network learns from the swapped assignments generated by the other. We introduce three techniques: i) soft perplexity-based regularization to penalize uncertain regions; ii) a threshold-searching approach to dynamically revise the confidence threshold; and iii) contrastive separation to address the coexistence problem. CoSA demonstrates exceptional performance, achieving mIoU of 76.2\% and 51.0\% on VOC and COCO validation datasets, respectively, surpassing existing baselines by a substantial margin. Notably, CoSA is the first single-stage approach to outperform all existing multi-stage methods including those with additional supervision. Code is avilable at \url{https://github.com/youshyee/CoSA}.

Code Implementations1 repo
Foundations

The foundational work for this paper's niche, ranked by how specifically the neighbourhood builds on it — not by global fame.

Your Notes