CVJun 9

From Simulation to Real-World: An In-Field 6D Pose Dataset and Baseline for Robotic Strawberry Harvesting

arXiv:2606.11381v16.4h-index: 28
Predicted impact top 78% in CV · last 90 daysOriginality Synthesis-oriented
AI Analysis

This work provides a benchmark dataset and baseline for 6D pose estimation in agricultural robotics, addressing the lack of real-world evaluation data.

The authors present the first real-world 6D pose ground truth dataset for strawberries (12,040 images) and a synthetic dataset with scene-level realism, revealing a significant sim-to-real gap that underscores the need for real field data in robotic harvesting.

Robotic strawberry harvesting requires precise 6D pose estimation; however, collecting 6D pose ground truth in real agricultural fields is inherently challenging. Existing 6D pose estimation methods have therefore relied solely on synthetic data that lacks scene-level realism, leaving their performance under real agricultural field conditions unquantified. In this work, we present, to the best of our knowledge, the first real-world 6D pose ground truth dataset of strawberries collected in actual agricultural fields (12,040 images). We also introduce a synthetic dataset rendered in NVIDIA Isaac Sim, featuring scene-level realism and domain randomization. Nevertheless, our experiments reveal that a significant sim-to-real gap persists, underscoring the necessity of real agricultural field data for reliable evaluation. We further quantify the sim-to-real gap through baseline 6D pose estimation results across backbone encoders, serving as a reference for future work. The real-world dataset will be made available upon acceptance.

Foundations

The foundational work for this paper's niche, ranked by how specifically the neighbourhood builds on it — not by global fame.

Your Notes