ROCVSep 21, 2018

GAPLE: Generalizable Approaching Policy LEarning for Robotic Object Searching in Indoor Environment

arXiv:1809.08287v212 citations
AI Analysis

This work addresses the limited generalization capability in visual navigation for robotic object searching, which is an incremental improvement over prior scene-driven or recognition-driven methods.

The paper tackles the problem of learning a generalizable action policy for robotic object searching in indoor environments using visual inputs, achieving validation through empirical studies on the House3D dataset and a physical platform.

We study the problem of learning a generalizable action policy for an intelligent agent to actively approach an object of interest in an indoor environment solely from its visual inputs. While scene-driven or recognition-driven visual navigation has been widely studied, prior efforts suffer severely from the limited generalization capability. In this paper, we first argue the object searching task is environment dependent while the approaching ability is general. To learn a generalizable approaching policy, we present a novel solution dubbed as GAPLE which adopts two channels of visual features: depth and semantic segmentation, as the inputs to the policy learning module. The empirical studies conducted on the House3D dataset as well as on a physical platform in a real world scenario validate our hypothesis, and we further provide in-depth qualitative analysis.

Foundations

The foundational work for this paper's niche, ranked by how specifically the neighbourhood builds on it — not by global fame.

Your Notes