CVAug 30, 2024

BOP-Distrib: Revisiting 6D Pose Estimation Benchmarks for Better Evaluation under Visual Ambiguities

arXiv:2408.17297v42 citationsh-index: 4
Originality Incremental advance
AI Analysis

This work addresses a critical evaluation gap in computer vision for researchers and practitioners, though it is incremental as it builds on existing datasets and methods.

The paper tackles the problem of evaluating 6D pose estimation methods under visual ambiguities by proposing an automatic method to re-annotate datasets with image-specific pose distributions, which significantly alters the ranking of state-of-the-art methods and establishes the first benchmark for pose distribution methods on real images.

6D pose estimation aims at determining the object pose that best explains the camera observation. The unique solution for non-ambiguous objects can turn into a multi-modal pose distribution for symmetrical objects or when occlusions of symmetry-breaking elements happen, depending on the viewpoint. Currently, 6D pose estimation methods are benchmarked on datasets that consider, for their ground truth annotations, visual ambiguities as only related to global object symmetries, whereas they should be defined per-image to account for the camera viewpoint. We thus first propose an automatic method to re-annotate those datasets with a 6D pose distribution specific to each image, taking into account the object surface visibility in the image to correctly determine the visual ambiguities. Second, given this improved ground truth, we re-evaluate the state-of-the-art single pose methods and show that this greatly modifies the ranking of these methods. Third, as some recent works focus on estimating the complete set of solutions, we derive a precision/recall formulation to evaluate them against our image-wise distribution ground truth, making it the first benchmark for pose distribution methods on real images.

Foundations

The foundational work for this paper's niche, ranked by how specifically the neighbourhood builds on it — not by global fame.

Your Notes