CVJul 6, 2017

Zero-Shot Deep Domain Adaptation

arXiv:1707.01922v597 citations
Originality Incremental advance
AI Analysis

This addresses a key limitation in domain adaptation for machine learning practitioners by enabling adaptation when target-domain data is unavailable, though it appears incremental as it builds on existing domain adaptation concepts.

The paper tackles the problem of domain adaptation without access to task-relevant target-domain data by proposing zero-shot deep domain adaptation (ZDDA), which uses task-irrelevant dual-domain pairs to learn representations applicable to both source and target domains, achieving domain adaptation in classification tasks on datasets like MNIST and SUN RGB-D.

Domain adaptation is an important tool to transfer knowledge about a task (e.g. classification) learned in a source domain to a second, or target domain. Current approaches assume that task-relevant target-domain data is available during training. We demonstrate how to perform domain adaptation when no such task-relevant target-domain data is available. To tackle this issue, we propose zero-shot deep domain adaptation (ZDDA), which uses privileged information from task-irrelevant dual-domain pairs. ZDDA learns a source-domain representation which is not only tailored for the task of interest but also close to the target-domain representation. Therefore, the source-domain task of interest solution (e.g. a classifier for classification tasks) which is jointly trained with the source-domain representation can be applicable to both the source and target representations. Using the MNIST, Fashion-MNIST, NIST, EMNIST, and SUN RGB-D datasets, we show that ZDDA can perform domain adaptation in classification tasks without access to task-relevant target-domain training data. We also extend ZDDA to perform sensor fusion in the SUN RGB-D scene classification task by simulating task-relevant target-domain representations with task-relevant source-domain data. To the best of our knowledge, ZDDA is the first domain adaptation and sensor fusion method which requires no task-relevant target-domain data. The underlying principle is not particular to computer vision data, but should be extensible to other domains.

Foundations

The foundational work for this paper's niche, ranked by how specifically the neighbourhood builds on it — not by global fame.

Your Notes