SYLGDec 18, 2023

Active search and coverage using point-cloud reinforcement learning

arXiv:2312.11410v11 citationsh-index: 26ICSTCC
Originality Synthesis-oriented
AI Analysis

This addresses a domain-specific problem in robotics and computer vision for efficient scene exploration, but it is incremental as it builds on existing deep RL and point-cloud processing methods.

The paper tackles the problem of optimizing a mobile 3D sensor's trajectory to quickly find and cover target objects, using an end-to-end deep reinforcement learning solution that achieves significantly better and more robust results than a greedy baseline.

We consider a problem in which the trajectory of a mobile 3D sensor must be optimized so that certain objects are both found in the overall scene and covered by the point cloud, as fast as possible. This problem is called target search and coverage, and the paper provides an end-to-end deep reinforcement learning (RL) solution to solve it. The deep neural network combines four components: deep hierarchical feature learning occurs in the first stage, followed by multi-head transformers in the second, max-pooling and merging with bypassed information to preserve spatial relationships in the third, and a distributional dueling network in the last stage. To evaluate the method, a simulator is developed where cylinders must be found by a Kinect sensor. A network architecture study shows that deep hierarchical feature learning works for RL and that by using farthest point sampling (FPS) we can reduce the amount of points and achieve not only a reduction of the network size but also better results. We also show that multi-head attention for point-clouds helps to learn the agent faster but converges to the same outcome. Finally, we compare RL using the best network with a greedy baseline that maximizes immediate rewards and requires for that purpose an oracle that predicts the next observation. We decided RL achieves significantly better and more robust results than the greedy strategy.

Foundations

The foundational work for this paper's niche, ranked by how specifically the neighbourhood builds on it — not by global fame.

Your Notes