AIMay 11

EmbodiSkill: Skill-Aware Reflection for Self-Evolving Embodied Agents

arXiv:2605.1033221.1
Predicted impact top 29% in AI · last 90 daysOriginality Incremental advance
AI Analysis

For embodied AI agents, this method enables autonomous skill improvement without training, addressing the challenge of self-evolving skills in physical environments.

EmbodiSkill introduces a training-free framework for embodied agents to self-evolve skills by distinguishing skill errors from execution lapses in trajectories, achieving 93.28% task success on ALFWorld with a frozen Qwen3.5-27B executor, outperforming GPT-5.2 by 31.58%.

Embodied agents can benefit from skills that guide object search, action execution, and state changes across diverse environments. Since embodied environments vary across layouts, object states, and other execution factors, these skills must self-evolve from trajectories generated during task execution. However, existing skill self-evolution methods are mainly developed in digital environments and often convert trajectories into coarse skill updates. Directly applying this paradigm to embodied settings is problematic, because a failed task execution may reflect not only incorrect skill content, but also an execution lapse in which the agent fails to follow valid guidance. We propose EmbodiSkill, a training-free framework for embodied skill self-evolution through skill-aware reflection and targeted revision. EmbodiSkill interprets each trajectory with respect to the current skill, uses skill-changing evidence to update the skill body, and uses execution-lapse evidence to preserve and emphasize valid guidance. Experiments on ALFWorld and EmbodiedBench show that EmbodiSkill consistently improves embodied task success. On ALFWorld, EmbodiSkill enables a frozen Qwen3.5-27B executor to reach 93.28% task success, outperforming GPT-5.2 used as a direct agent without skills by 31.58%. These results show that skill-aware self-evolution helps embodied agents accumulate reusable procedural knowledge from their own trajectories.

Foundations

The foundational work for this paper's niche, ranked by how specifically the neighbourhood builds on it — not by global fame.

Your Notes