CVJun 1

Diagnosis of Human Object Interaction Detectors for Real World Educational Applications

arXiv:2606.0278922.0
Predicted impact top 90% in CV · last 90 daysOriginality Incremental advance
AI Analysis

This work addresses the performance degradation of HOI detectors in real-world educational environments, offering a targeted refinement strategy for domain adaptation.

The authors introduce a diagnosis-driven framework for human-object interaction (HOI) detection in real-world educational settings, improving the macro-F1 score of a pretrained CDN model from 48.6 to 90.2 on a CCATT medical training dataset.

Human-object interaction (HOI) recognition is critical for automatically analyzing student behavior in complex educational environments. Although state-of-the-art (SOTA) HOI detectors perform well on benchmark datasets, their performance often degrades when deployed in real-world training environments due to domain-specific objects, occlusions, and complex visual conditions. In this paper, we introduce a diagnosis-driven framework that integrates a triplet-level HOI error taxonomy with error-factor attribution analysis for real-world educational video data. We study this problem in the context of Critical Care Air Transport Team (CCATT) mixed-reality medical training. Based on an analysis of HOI failure modes and their causes, we develop a diagnosis-informed refinement strategy for adapting pretrained HOI models to the target domain. Experiments on the CCATT dataset show that this approach improves the macro-F1 score of a pretrained CDN model from 48.6 to 90.2 through targeted refinement guided by diagnosed error factors. These results highlight the value of detailed diagnostic analysis for informing targeted adaptation of HOI models in real-world educational environments.

Foundations

The foundational work for this paper's niche, ranked by how specifically the neighbourhood builds on it — not by global fame.

Your Notes