CVMay 23, 2025

Semantic segmentation with reward

arXiv:2505.17905v11 citationsh-index: 3
Originality Incremental advance
AI Analysis

This work addresses the challenge of training semantic segmentation models with limited labeling, which is incremental as it applies reinforcement learning to a known bottleneck in weakly supervised segmentation.

The paper tackles the problem of semantic segmentation when pixel-level labels are unavailable by proposing RSS, a reward-based reinforcement learning method that uses image-level feedback to train segmentation networks, achieving convergence and outperforming existing weakly supervised methods.

In real-world scenarios, pixel-level labeling is not always available. Sometimes, we need a semantic segmentation network, and even a visual encoder can have a high compatibility, and can be trained using various types of feedback beyond traditional labels, such as feedback that indicates the quality of the parsing results. To tackle this issue, we proposed RSS (Reward in Semantic Segmentation), the first practical application of reward-based reinforcement learning on pure semantic segmentation offered in two granular levels (pixel-level and image-level). RSS incorporates various novel technologies, such as progressive scale rewards (PSR) and pair-wise spatial difference (PSD), to ensure that the reward facilitates the convergence of the semantic segmentation network, especially under image-level rewards. Experiments and visualizations on benchmark datasets demonstrate that the proposed RSS can successfully ensure the convergence of the semantic segmentation network on two levels of rewards. Additionally, the RSS, which utilizes an image-level reward, outperforms existing weakly supervised methods that also rely solely on image-level signals during training.

Foundations

The foundational work for this paper's niche, ranked by how specifically the neighbourhood builds on it — not by global fame.

Your Notes