RO LGMay 29

HOIST: Humanoid Optimization with Imitation and Sample-efficient Tuning for Manipulating Suspended Loads

Songyang Liu, Shunyu Yao, Dingyuan Huang, Shuai Li

arXiv:2606.0025271.2

AI Analysis

For humanoid robotics, this work addresses the challenge of underactuated load manipulation by providing a sample-efficient tuning method that improves upon imitation learning.

HOIST combines imitation learning from VR teleoperation with iterative batched reinforcement learning to improve a humanoid robot's manipulation of suspended loads, reducing translational placement error by 19.9 cm and angular error by 3.56 degrees compared to imitation-only baselines.

Manipulating suspended payloads with humanoid robots is challenging because the robot can only influence an underactuated, oscillatory load through whole-body motion and intermittent contact. Imitation learning provides safe initial behavior but does not directly optimize final placement, while reinforcement learning from scratch is unsafe and sample-inefficient on real humanoids. We present HOIST-Humanoid Optimized with Imitation and Sample-efficient Tuning for manipulating suspended loads. HOIST first finetunes a high-level vision-language-action (VLA) policy from virtual-reality (VR) teleoperation demonstrations and executes its commands through a whole-body controller. It then uses VLA rollouts and iterative batched RL to improve placement accuracy and stopping behavior. Experiments in simulation and on a real humanoid show that HOIST improves over imitation-only and additional-demonstration baselines; compared with pure VLA rollouts, HOIST reduces translational placement error by 19.9 cm and raw angular error by 3.56 degrees, demonstrating the potential of humanoids for underactuated material-handling tasks.

View on arXiv PDF

Similar