CVApr 25, 2022

Multi-Head Distillation for Continual Unsupervised Domain Adaptation in Semantic Segmentation

arXiv:2204.11667v125 citationsh-index: 60
Originality Incremental advance
AI Analysis

This work addresses a practical issue for real-world perception systems that need to handle varying conditions like lighting and locations, though it is incremental in building on existing continual learning and domain adaptation techniques.

The paper tackles the problem of catastrophic forgetting in continual unsupervised domain adaptation for semantic segmentation, where models must adapt to multiple target domains sequentially without revisiting previous ones, and proposes MuHDi, a multi-head distillation method that achieves competitive performance on challenging benchmarks.

Unsupervised Domain Adaptation (UDA) is a transfer learning task which aims at training on an unlabeled target domain by leveraging a labeled source domain. Beyond the traditional scope of UDA with a single source domain and a single target domain, real-world perception systems face a variety of scenarios to handle, from varying lighting conditions to many cities around the world. In this context, UDAs with several domains increase the challenges with the addition of distribution shifts within the different target domains. This work focuses on a novel framework for learning UDA, continuous UDA, in which models operate on multiple target domains discovered sequentially, without access to previous target domains. We propose MuHDi, for Multi-Head Distillation, a method that solves the catastrophic forgetting problem, inherent in continual learning tasks. MuHDi performs distillation at multiple levels from the previous model as well as an auxiliary target-specialist segmentation head. We report both extensive ablation and experiments on challenging multi-target UDA semantic segmentation benchmarks to validate the proposed learning scheme and architecture.

Code Implementations1 repo
Foundations

The foundational work for this paper's niche, ranked by how specifically the neighbourhood builds on it — not by global fame.

Your Notes