CVApr 16, 2022

Interactiveness Field in Human-Object Interactions

arXiv:2204.07718v163 citationsh-index: 72Has Code
Originality Incremental advance
AI Analysis

This work addresses a core bottleneck in activity understanding for computer vision applications, offering an incremental but effective enhancement to existing HOI detection methods.

The paper tackles the challenge of discovering interactive human-object pairs in Human-Object Interaction (HOI) detection by introducing an 'interactiveness field' based on a bimodal prior, resulting in significant performance improvements on widely-used benchmarks.

Human-Object Interaction (HOI) detection plays a core role in activity understanding. Though recent two/one-stage methods have achieved impressive results, as an essential step, discovering interactive human-object pairs remains challenging. Both one/two-stage methods fail to effectively extract interactive pairs instead of generating redundant negative pairs. In this work, we introduce a previously overlooked interactiveness bimodal prior: given an object in an image, after pairing it with the humans, the generated pairs are either mostly non-interactive, or mostly interactive, with the former more frequent than the latter. Based on this interactiveness bimodal prior we propose the "interactiveness field". To make the learned field compatible with real HOI image considerations, we propose new energy constraints based on the cardinality and difference in the inherent "interactiveness field" underlying interactive versus non-interactive pairs. Consequently, our method can detect more precise pairs and thus significantly boost HOI detection performance, which is validated on widely-used benchmarks where we achieve decent improvements over state-of-the-arts. Our code is available at https://github.com/Foruck/Interactiveness-Field.

Code Implementations1 repo
Foundations

The foundational work for this paper's niche, ranked by how specifically the neighbourhood builds on it — not by global fame.

Your Notes