ROCVOct 9, 2025

R2RGEN: Real-to-Real 3D Data Generation for Spatially Generalized Manipulation

arXiv:2510.08547v14 citationsh-index: 14
Originality Incremental advance
AI Analysis

This work addresses the need for efficient data collection in robotics to improve spatial generalization, offering a plug-and-play solution that reduces sim-to-real gaps, though it appears incremental in advancing data generation methods.

The paper tackles the problem of spatial generalization in robotic manipulation by proposing R2RGen, a real-to-real 3D data generation framework that augments pointcloud observation-action pairs from minimal source demonstrations, resulting in enhanced data efficiency and strong potential for mobile manipulation applications.

Towards the aim of generalized robotic manipulation, spatial generalization is the most fundamental capability that requires the policy to work robustly under different spatial distribution of objects, environment and agent itself. To achieve this, substantial human demonstrations need to be collected to cover different spatial configurations for training a generalized visuomotor policy via imitation learning. Prior works explore a promising direction that leverages data generation to acquire abundant spatially diverse data from minimal source demonstrations. However, most approaches face significant sim-to-real gap and are often limited to constrained settings, such as fixed-base scenarios and predefined camera viewpoints. In this paper, we propose a real-to-real 3D data generation framework (R2RGen) that directly augments the pointcloud observation-action pairs to generate real-world data. R2RGen is simulator- and rendering-free, thus being efficient and plug-and-play. Specifically, given a single source demonstration, we introduce an annotation mechanism for fine-grained parsing of scene and trajectory. A group-wise augmentation strategy is proposed to handle complex multi-object compositions and diverse task constraints. We further present camera-aware processing to align the distribution of generated data with real-world 3D sensor. Empirically, R2RGen substantially enhances data efficiency on extensive experiments and demonstrates strong potential for scaling and application on mobile manipulation.

Foundations

The foundational work for this paper's niche, ranked by how specifically the neighbourhood builds on it — not by global fame.

Your Notes