CVROFeb 24, 2018

Adaptive Deep Learning through Visual Domain Localization

arXiv:1802.08833v123 citations
Originality Incremental advance
AI Analysis

This addresses the challenge of visual domain adaptation in robotics, where access to all target classes is unrealistic, offering a more practical solution for deployment in varied environments.

The paper tackles the problem of domain adaptation for robot vision when only a subset of target domain classes is available during training, proposing a method that spatially grounds domain shift to reduce parameters and improve flexibility, achieving suitability for robot vision as confirmed on the iCub World database.

A commercial robot, trained by its manufacturer to recognize a predefined number and type of objects, might be used in many settings, that will in general differ in their illumination conditions, background, type and degree of clutter, and so on. Recent computer vision works tackle this generalization issue through domain adaptation methods, assuming as source the visual domain where the system is trained and as target the domain of deployment. All approaches assume to have access to images from all classes of the target during training, an unrealistic condition in robotics applications. We address this issue proposing an algorithm that takes into account the specific needs of robot vision. Our intuition is that the nature of the domain shift experienced mostly in robotics is local. We exploit this through the learning of maps that spatially ground the domain and quantify the degree of shift, embedded into an end-to-end deep domain adaptation architecture. By explicitly localizing the roots of the domain shift we significantly reduce the number of parameters of the architecture to tune, we gain the flexibility necessary to deal with subset of categories in the target domain at training time, and we provide a clear feedback on the rationale behind any classification decision, which can be exploited in human-robot interactions. Experiments on two different settings of the iCub World database confirm the suitability of our method for robot vision.

Code Implementations1 repo
Foundations

The foundational work for this paper's niche, ranked by how specifically the neighbourhood builds on it — not by global fame.

Your Notes