CVMay 31, 2019

Deep Dual Relation Modeling for Egocentric Interaction Recognition

arXiv:1905.13586v133 citations
Originality Incremental advance
AI Analysis

This addresses a specific bottleneck in human-human interaction analysis for egocentric video applications, though it is incremental.

The paper tackles egocentric interaction recognition by modeling relations between the camera wearer and interactor, achieving state-of-the-art results on three datasets.

Egocentric interaction recognition aims to recognize the camera wearer's interactions with the interactor who faces the camera wearer in egocentric videos. In such a human-human interaction analysis problem, it is crucial to explore the relations between the camera wearer and the interactor. However, most existing works directly model the interactions as a whole and lack modeling the relations between the two interacting persons. To exploit the strong relations for egocentric interaction recognition, we introduce a dual relation modeling framework which learns to model the relations between the camera wearer and the interactor based on the individual action representations of the two persons. Specifically, we develop a novel interactive LSTM module, the key component of our framework, to explicitly model the relations between the two interacting persons based on their individual action representations, which are collaboratively learned with an interactor attention module and a global-local motion module. Experimental results on three egocentric interaction datasets show the effectiveness of our method and advantage over state-of-the-arts.

Foundations

The foundational work for this paper's niche, ranked by how specifically the neighbourhood builds on it — not by global fame.

Your Notes