CVMar 20

UniPR: Unified Object-level Real-to-Sim Perception and Reconstruction from a Single Stereo Pair

arXiv:2603.1961655.8h-index: 5
AI Analysis

This addresses inefficiency and cumulative error in real-to-sim transfer for robotics, though it is incremental as it builds on existing perception and reconstruction tasks.

The paper tackles the problem of inefficient and error-prone modular pipelines for object perception and reconstruction from images by proposing UniPR, an end-to-end framework that processes a single stereo pair to reconstruct all objects in a scene in parallel, achieving significant efficiency gains and preserving physical proportions.

Perceiving and reconstructing objects from images are critical for real-to-sim transfer tasks, which are widely used in the robotics community. Existing methods rely on multiple submodules such as detection, segmentation, shape reconstruction, and pose estimation to complete the pipeline. However, such modular pipelines suffer from inefficiency and cumulative error, as each stage operates on only partial or locally refined information while discarding global context. To address these limitations, we propose UniPR, the first end-to-end object-level real-to-sim perception and reconstruction framework. Operating directly on a single stereo image pair, UniPR leverages geometric constraints to resolve the scale ambiguity. We introduce Pose-Aware Shape Representation to eliminate the need for per-category canonical definitions and to bridge the gap between reconstruction and pose estimation tasks. Furthermore, we construct a large-vocabulary stereo dataset, LVS6D, comprising over 6,300 objects, to facilitate large-scale research in this area. Extensive experiments demonstrate that UniPR reconstructs all objects in a scene in parallel within a single forward pass, achieving significant efficiency gains and preserves true physical proportions across diverse object types, highlighting its potential for practical robotic applications.

Foundations

The foundational work for this paper's niche, ranked by how specifically the neighbourhood builds on it — not by global fame.

Your Notes