CVMay 19, 2025

Self-Supervised Learning for Image Segmentation: A Comprehensive Survey

arXiv:2505.13584v14 citationsh-index: 15
Originality Synthesis-oriented
AI Analysis

It provides a comprehensive overview for researchers in computer vision, but it is incremental as it synthesizes existing work without introducing new methods.

This survey tackles the challenge of summarizing self-supervised learning methods for image segmentation by reviewing over 150 recent articles, categorizing pretext tasks, downstream tasks, and benchmark datasets to guide researchers.

Supervised learning demands large amounts of precisely annotated data to achieve promising results. Such data curation is labor-intensive and imposes significant overhead regarding time and costs. Self-supervised learning (SSL) partially overcomes these limitations by exploiting vast amounts of unlabeled data and creating surrogate (pretext or proxy) tasks to learn useful representations without manual labeling. As a result, SSL has become a powerful machine learning (ML) paradigm for solving several practical downstream computer vision problems, such as classification, detection, and segmentation. Image segmentation is the cornerstone of many high-level visual perception applications, including medical imaging, intelligent transportation, agriculture, and surveillance. Although there is substantial research potential for developing advanced algorithms for SSL-based semantic segmentation, a comprehensive study of existing methodologies is essential to trace advances and guide emerging researchers. This survey thoroughly investigates over 150 recent image segmentation articles, particularly focusing on SSL. It provides a practical categorization of pretext tasks, downstream tasks, and commonly used benchmark datasets for image segmentation research. It concludes with key observations distilled from a large body of literature and offers future directions to make this research field more accessible and comprehensible for readers.

Foundations

The foundational work for this paper's niche, ranked by how specifically the neighbourhood builds on it — not by global fame.

Your Notes