CVLGNov 8, 2018

Adaptive Semantic Segmentation with a Strategic Curriculum of Proxy Labels

arXiv:1811.03542v118 citations
Originality Incremental advance
AI Analysis

This addresses the costly annotation and domain shift issues in semantic segmentation for computer vision applications, representing an incremental improvement over existing domain adaptation methods.

The paper tackles the problem of poor generalization in semantic segmentation across domains by training a network with a mixture of labeled source data and proxy-labeled target data, achieving state-of-the-art results in unsupervised domain adaptation on datasets like GTA5, Cityscapes, and BDD100k.

Training deep networks for semantic segmentation requires annotation of large amounts of data, which can be time-consuming and expensive. Unfortunately, these trained networks still generalize poorly when tested in domains not consistent with the training data. In this paper, we show that by carefully presenting a mixture of labeled source domain and proxy-labeled target domain data to a network, we can achieve state-of-the-art unsupervised domain adaptation results. With our design, the network progressively learns features specific to the target domain using annotation from only the source domain. We generate proxy labels for the target domain using the network's own predictions. Our architecture then allows selective mining of easy samples from this set of proxy labels, and hard samples from the annotated source domain. We conduct a series of experiments with the GTA5, Cityscapes and BDD100k datasets on synthetic-to-real domain adaptation and geographic domain adaptation, showing the advantages of our method over baselines and existing approaches.

Foundations

The foundational work for this paper's niche, ranked by how specifically the neighbourhood builds on it — not by global fame.

Your Notes