CVMar 24, 2023

OPDMulti: Openable Part Detection for Multiple Objects

arXiv:2303.14087v146 citationsh-index: 46
Originality Incremental advance
AI Analysis

This work addresses a more realistic scenario for robotics and computer vision applications, though it is incremental in extending from single to multiple objects.

The paper tackles the problem of detecting openable parts in images with multiple objects, rather than the unrealistic single-object setting, and introduces OPDFormer, which significantly outperforms prior methods.

Openable part detection is the task of detecting the openable parts of an object in a single-view image, and predicting corresponding motion parameters. Prior work investigated the unrealistic setting where all input images only contain a single openable object. We generalize this task to scenes with multiple objects each potentially possessing openable parts, and create a corresponding dataset based on real-world scenes. We then address this more challenging scenario with OPDFormer: a part-aware transformer architecture. Our experiments show that the OPDFormer architecture significantly outperforms prior work. The more realistic multiple-object scenarios we investigated remain challenging for all methods, indicating opportunities for future work.

Code Implementations1 repo
Foundations

The foundational work for this paper's niche, ranked by how specifically the neighbourhood builds on it — not by global fame.

Your Notes