CVJun 12, 2019

LAEO-Net: revisiting people Looking At Each Other in videos

arXiv:1906.05261v172 citations
Originality Incremental advance
AI Analysis

This work addresses the challenge of understanding social interactions in videos for applications like social network analysis, though it is incremental as it builds on existing LAEO detection methods.

The paper tackles the problem of detecting when people are Looking At Each Other (LAEO) in videos by proposing LAEO-Net, a deep CNN that uses spatio-temporal tracks and achieves state-of-the-art results on the TVHID-LAEO dataset, significantly outperforming previous approaches.

Capturing the `mutual gaze' of people is essential for understanding and interpreting the social interactions between them. To this end, this paper addresses the problem of detecting people Looking At Each Other (LAEO) in video sequences. For this purpose, we propose LAEO-Net, a new deep CNN for determining LAEO in videos. In contrast to previous works, LAEO-Net takes spatio-temporal tracks as input and reasons about the whole track. It consists of three branches, one for each character's tracked head and one for their relative position. Moreover, we introduce two new LAEO datasets: UCO-LAEO and AVA-LAEO. A thorough experimental evaluation demonstrates the ability of LAEONet to successfully determine if two people are LAEO and the temporal window where it happens. Our model achieves state-of-the-art results on the existing TVHID-LAEO video dataset, significantly outperforming previous approaches. Finally, we apply LAEO-Net to social network analysis, where we automatically infer the social relationship between pairs of people based on the frequency and duration that they LAEO.

Code Implementations1 repo
Foundations

The foundational work for this paper's niche, ranked by how specifically the neighbourhood builds on it — not by global fame.

Your Notes