RO AIApr 9

Learning Without Losing Identity: Capability Evolution for Embodied Agents

Xue Qin, Simin Luan, John See, Cong Yang, Zhijun Li

arXiv:2604.0779967.85 citationsh-index: 2

AI Analysis

This addresses the challenge of continuous improvement without identity loss for embodied agents in dynamic environments, representing a novel approach rather than an incremental one.

The paper tackles the problem of instability and identity loss in long-lived embodied agents by proposing a capability-centric evolution paradigm, where modular, versioned units of functionality evolve separately from the agent's identity, resulting in task success rates improving from 32.4% to 91.3% over 20 iterations while preserving zero policy drift and safety violations.

Embodied agents are expected to operate persistently in dynamic physical environments, continuously acquiring new capabilities over time. Existing approaches to improving agent performance often rely on modifying the agent itself -- through prompt engineering, policy updates, or structural redesign -- leading to instability and loss of identity in long-lived systems. In this work, we propose a capability-centric evolution paradigm for embodied agents. We argue that a robot should maintain a persistent agent as its cognitive identity, while enabling continuous improvement through the evolution of its capabilities. Specifically, we introduce the concept of Embodied Capability Modules (ECMs), which represent modular, versioned units of embodied functionality that can be learned, refined, and composed over time. We present a unified framework in which capability evolution is decoupled from agent identity. Capabilities evolve through a closed-loop process involving task execution, experience collection, model refinement, and module updating, while all executions are governed by a runtime layer that enforces safety and policy constraints. We demonstrate through simulated embodied tasks that capability evolution improves task success rates from 32.4% to 91.3% over 20 iterations, outperforming both agent-modification baselines and established skill-learning methods (SPiRL, SkiMo), while preserving zero policy drift and zero safety violations. Our results suggest that separating agent identity from capability evolution provides a scalable and safe foundation for long-term embodied intelligence.

View on arXiv PDF

Similar