CVOct 10, 2018

Learning Deep Representations for Semantic Image Parsing: a Comprehensive Overview

arXiv:1810.04377v11 citations
Originality Synthesis-oriented
AI Analysis

This is an incremental survey paper for researchers in computer vision.

This paper provides a comprehensive overview of semantic image parsing, summarizing progress in category-level segmentation, instance-level segmentation, and beyond segmentation, while discussing frameworks, datasets, and future challenges.

Semantic image parsing, which refers to the process of decomposing images into semantic regions and constructing the structure representation of the input, has recently aroused widespread interest in the field of computer vision. The recent application of deep representation learning has driven this field into a new stage of development. In this paper, we summarize three aspects of the progress of research on semantic image parsing, i.e., category-level semantic segmentation, instance-level semantic segmentation, and beyond segmentation. Specifically, we first review the general frameworks for each task and introduce the relevant variants. The advantages and limitations of each method are also discussed. Moreover, we present a comprehensive comparison of different benchmark datasets and evaluation metrics. Finally, we explore the future trends and challenges of semantic image parsing.

Foundations

The foundational work for this paper's niche, ranked by how specifically the neighbourhood builds on it — not by global fame.

Your Notes