ROAICVMar 17, 2025

Humanoid Policy ~ Human Policy

arXiv:2503.13441v375 citationsh-index: 17
Originality Incremental advance
AI Analysis

This addresses the scalability issue in robot learning by reducing reliance on expensive tele-operated data, though it is incremental in leveraging human data for cross-embodiment training.

The paper tackles the problem of training manipulation policies for humanoid robots by using egocentric human demonstrations as cross-embodiment training data, resulting in improved generalization and robustness with significantly better data collection efficiency.

Training manipulation policies for humanoid robots with diverse data enhances their robustness and generalization across tasks and platforms. However, learning solely from robot demonstrations is labor-intensive, requiring expensive tele-operated data collection which is difficult to scale. This paper investigates a more scalable data source, egocentric human demonstrations, to serve as cross-embodiment training data for robot learning. We mitigate the embodiment gap between humanoids and humans from both the data and modeling perspectives. We collect an egocentric task-oriented dataset (PH2D) that is directly aligned with humanoid manipulation demonstrations. We then train a human-humanoid behavior policy, which we term Human Action Transformer (HAT). The state-action space of HAT is unified for both humans and humanoid robots and can be differentiably retargeted to robot actions. Co-trained with smaller-scale robot data, HAT directly models humanoid robots and humans as different embodiments without additional supervision. We show that human data improves both generalization and robustness of HAT with significantly better data collection efficiency. Code and data: https://human-as-robot.github.io/

Foundations

The foundational work for this paper's niche, ranked by how specifically the neighbourhood builds on it — not by global fame.

Your Notes