EgoLive: A Large-Scale Egocentric Dataset from Real-World Human Tasks
For the robot learning community, EgoLive provides a scalable, high-quality dataset to accelerate research in generalizable robotic models, though its impact is incremental as it follows the established paradigm of egocentric video collection.
EgoLive is a large-scale egocentric dataset for robot manipulation learning, featuring 3,000 hours of multi-modal data from real-world tasks across home service, retail, and other scenarios, with high-precision annotations. It aims to address the scarcity of scalable, high-quality datasets for robot learning.
The advancement of robot learning is currently hindered by the scarcity of large-scale, high-quality datasets. While established data collection methods such as teleoperation and universal manipulation interfaces dominate current datasets, they suffer from inherent limitations in scalability and real-world deployability. Human egocentric video collection, by contrast, has emerged as a promising approach to enable scalable, natural and in-the-wild data collection. As such, we present EgoLive, a large-scale, high-quality egocentric dataset designed explicitly for robot manipulation learning. EgoLive establishes three distinctive technical advantages over existing egocentric datasets: first, it represents the largest open-source annotated egocentric dataset focused on real-world task-oriented human routines to date; second, it delivers leading data quality via a customized head-mounted capture device and comprehensive high-precision multi-modal annotations; third, all data is collected exclusively in unconstrained real-world scenarios and encompasses vertical field human working data, including home service, retail, and other practical work scenarios, providing superior diversity and ecological validity. With the introduction of EgoLive, we aim to provide the research community with a scalable, high-quality dataset that accelerates breakthroughs in generalizable robotic models and facilitates the real-world deployment of robot systems.