ROHCMar 8

Uncertainty Mitigation and Intent Inference: A Dual-Mode Human-Machine Joint Planning System

arXiv:2603.07822v11 citations
Predicted impact top 15% in RO · last 90 daysOriginality Highly original
AI Analysis

This work is significant for improving human-robot collaboration in open-world environments by enabling robots to actively resolve uncertainties and infer human intent, making them more effective teammates.

This paper introduces a human-robot joint planning system that addresses task-relevant knowledge gaps and latent human intent. The system reduces interaction cost by 51.9% in uncertainty-mitigation planning and decreases task execution time by 25.4% in intent-aware cooperation compared to baselines.

Effective human-robot collaboration in open-world environments requires joint planning under uncertain conditions. However, existing approaches often treat humans as passive supervisors, preventing autonomous agents from becoming human-like teammates that can actively model teammate behaviors, reason about knowledge gaps, query, and elicit responses through communication to resolve uncertainties. To address these limitations, we propose a unified human-robot joint planning system designed to tackle dual sources of uncertainty: task-relevant knowledge gaps and latent human intent. Our system operates in two complementary modes. First, an uncertainty-mitigation joint planning module enables two-way conversations to resolve semantic ambiguity and object uncertainty. It utilizes an LLM-assisted active elicitation mechanism and a hypothesis-augmented A^* search, subsequently computing an optimal querying policy via dynamic programming to minimize interaction and verification costs. Second, a real-time intent-aware collaboration module maintains a probabilistic belief over the human's latent task intent via spatial and directional cues, enabling dynamic, coordination-aware task selection for agents without explicit communication. We validate the proposed system in both Gazebo simulations and real-world UAV deployments integrated with a Vision-Language Model (VLM)-based 3D semantic perception pipeline. Experimental results demonstrate that the system significantly cuts the interaction cost by 51.9% in uncertainty-mitigation planning and reduces the task execution time by 25.4% in intent-aware cooperation compared to the baselines.

Foundations

The foundational work for this paper's niche, ranked by how specifically the neighbourhood builds on it — not by global fame.

Your Notes