ROCVJan 23, 2023

Learning to View: Decision Transformers for Active Object Detection

CMU
arXiv:2301.09544v124 citationsh-index: 20
Originality Incremental advance
AI Analysis

This work addresses the challenge of improving object detection quality for robots by integrating planning and perception, though it is incremental as it builds on existing RL and transformer methods.

The paper tackles the problem of passive object detection in robotics by using reinforcement learning to actively control robot movement for better image collection, resulting in a method that outperforms expert policies and offline RL baselines on an indoor simulator dataset.

Active perception describes a broad class of techniques that couple planning and perception systems to move the robot in a way to give the robot more information about the environment. In most robotic systems, perception is typically independent of motion planning. For example, traditional object detection is passive: it operates only on the images it receives. However, we have a chance to improve the results if we allow planning to consume detection signals and move the robot to collect views that maximize the quality of the results. In this paper, we use reinforcement learning (RL) methods to control the robot in order to obtain images that maximize the detection quality. Specifically, we propose using a Decision Transformer with online fine-tuning, which first optimizes the policy with a pre-collected expert dataset and then improves the learned policy by exploring better solutions in the environment. We evaluate the performance of proposed method on an interactive dataset collected from an indoor scenario simulator. Experimental results demonstrate that our method outperforms all baselines, including expert policy and pure offline RL methods. We also provide exhaustive analyses of the reward distribution and observation space.

Foundations

The foundational work for this paper's niche, ranked by how specifically the neighbourhood builds on it — not by global fame.

Your Notes