ROCVMay 6, 2025

OpenHelix: A Short Survey, Empirical Analysis, and Open-Source Dual-System VLA Model for Robotic Manipulation

arXiv:2505.03912v175 citationsh-index: 18Has Code
Originality Synthesis-oriented
AI Analysis

This provides an incremental resource for researchers in embodied intelligence to analyze and optimize dual-system VLA architectures.

The paper addresses the lack of open-source dual-system VLA models for robotic manipulation by summarizing existing architectures and conducting empirical evaluations, resulting in a low-cost open-source model for further exploration.

Dual-system VLA (Vision-Language-Action) architectures have become a hot topic in embodied intelligence research, but there is a lack of sufficient open-source work for further performance analysis and optimization. To address this problem, this paper will summarize and compare the structural designs of existing dual-system architectures, and conduct systematic empirical evaluations on the core design elements of existing dual-system architectures. Ultimately, it will provide a low-cost open-source model for further exploration. Of course, this project will continue to update with more experimental conclusions and open-source models with improved performance for everyone to choose from. Project page: https://openhelix-robot.github.io/.

Code Implementations1 repo
Foundations

The foundational work for this paper's niche, ranked by how specifically the neighbourhood builds on it — not by global fame.

Your Notes