ROApr 16

POMDP-based Object Search with Growing State Space and Hybrid Action Domain

Yongbo Chen, Hesheng Wang, Shoudong Huang, Hanna Kurniawati

arXiv:2604.1496525.3h-index: 6

Predicted impact top 70% in RO · last 90 daysOriginality Incremental advance

AI Analysis

For mobile robotics, this work provides a novel online POMDP solver that improves object search efficiency in realistic indoor settings with limited perception and computational constraints.

The paper tackles the problem of efficient object search in complex 3D indoor environments for mobile robots, proposing a POMDP-based method (GNPF-kCT) that handles growing state spaces and hybrid action domains. In Gazebo simulations with Fetch and Stretch robots, it achieves faster and more reliable target localization than POMDP-based baselines and SOTA non-POMDP methods including LLM-based approaches.

Efficiently locating target objects in complex indoor environments with diverse furniture, such as shelves, tables, and beds, is a significant challenge for mobile robots. This difficulty arises from factors like localization errors, limited fields of view, and visual occlusion. We address this by framing the object-search task as a highdimensional Partially Observable Markov Decision Process (POMDP) with a growing state space and hybrid (continuous and discrete) action spaces in 3D environments. Based on a meticulously designed perception module, a novel online POMDP solver named the growing neural process filtered k-center clustering tree (GNPF-kCT) is proposed to tackle this problem. Optimal actions are selected using Monte Carlo Tree Search (MCTS) with belief tree reuse for growing state space, a neural process network to filter useless primitive actions, and k-center clustering hypersphere discretization for efficient refinement of high-dimensional action spaces. A modified upper-confidence bound (UCB), informed by belief differences and action value functions within cells of estimated diameters, guides MCTS expansion. Theoretical analysis validates the convergence and performance potential of our method. To address scenarios with limited information or rewards, we also introduce a guessed target object with a grid-world model as a key strategy to enhance search efficiency. Extensive Gazebo simulations with Fetch and Stretch robots demonstrate faster and more reliable target localization than POMDP-based baselines and state-of-the-art (SOTA) non-POMDP-based solvers, especially large language model (LLM) based methods, in object search under the same computational constraints and perception systems. Real-world tests in office environments confirm the practical applicability of our approach. Project page: https://sites.google.com/view/gnpfkct.

View on arXiv PDF

Similar