CVMar 25

Lookalike3D: Seeing Double in 3D

arXiv:2603.2471383.1h-index: 5
Predicted impact top 18% in CV · last 90 daysOriginality Incremental advance
AI Analysis

This addresses the challenge of leveraging repeated objects for consistent 3D perception in indoor scenes, representing an incremental advance in domain-specific 3D vision.

The paper tackles the problem of detecting identical and similar objects in 3D indoor scenes, introducing a new task and dataset, and shows a 104% IoU improvement over baselines.

3D object understanding and generation methods produce impressive results, yet they often overlook a pervasive source of information in real-world scenes: repeated objects. We introduce the task of lookalike object detection in indoor scenes, which leverages repeated and complementary cues from identical and near-identical object pairs. Given an input scene, the task is to classify pairs of objects as identical, similar or different using multiview images as input. To address this, we present Lookalike3D, a multiview image transformer that effectively distinguishes such object pairs by harnessing strong semantic priors from large image foundation models. To support this task, we collected the 3DTwins dataset, containing 76k manually annotated identical, similar and different pairs of objects based on ScanNet++, and show an improvement of 104% IoU over baselines. We demonstrate how our method improves downstream tasks such as enabling joint 3D object reconstruction and part co-segmentation, turning repeated and lookalike objects into a powerful cue for consistent, high-quality 3D perception. Our code, dataset and models will be made publicly available.

Foundations

The foundational work for this paper's niche, ranked by how specifically the neighbourhood builds on it — not by global fame.

Your Notes