CV ROMay 26

OSMa-Bench++: Toward Open-Ended Benchmarking of Semantic Mapping for Manipulation with Prompt-Generated Synthetic Scenes

Regina Kurkova, Maxim Popov, Sergey Kolyubin

arXiv:2605.2683152.0Has Code

Predicted impact top 67% in CV · last 90 daysOriginality Synthesis-oriented

AI Analysis

For researchers evaluating semantic mapping methods for robotic manipulation, this provides a more extensible and controllable benchmarking framework that better covers manipulation-relevant corner cases.

This work extends OSMa-Bench to enable controllable benchmarking of semantic mapping for manipulation using prompt-generated synthetic scenes, supporting targeted stress-testing under conditions like clutter, occlusion, and lighting variation.

Semantic mapping methods are increasingly used as intermediate scene representations for downstream robotic reasoning and manipulation, yet their evaluation is still largely tied to fixed benchmark datasets with limited coverage of manipulation-relevant corner cases. In this work, we extend OSMa-Bench toward controllable benchmarking with prompt-generated synthetic indoor scenes. Our pipeline automatically generates scene descriptions, synthesizes corresponding environments with SceneSmith, and adapts the resulting assets into an OSMa-Bench-compatible simulation format. This adaptation requires a nontrivial intermediate layer, including semantic normalization, material and texture repair, shader fallback policies, floor handling, navigation setup, and controlled lighting configuration. A key advantage of the proposed setup is that the original scene-generation prompt is known in advance and can therefore serve as an auxiliary semantic specification of the intended scene. We use this property to extend the VQA component of OSMa-Bench with a prompt-grounded question category. The resulting framework supports targeted stress-testing of semantic scene representations under conditions such as clutter, small objects, partial occlusions, and lighting variation, and makes benchmarking more extensible and better aligned with downstream manipulation requirements. Our code is available at https://github.com/be2rlab/OSMa-Bench-v2.

View on arXiv PDF Code

Similar