CVIVDec 23, 2019

Analysis of the hands in egocentric vision: A survey

arXiv:1912.10867v396 citations
Originality Synthesis-oriented
AI Analysis

It synthesizes existing research for researchers in computer vision and human-computer interaction, but is incremental as it does not introduce new methods.

This survey reviews literature on analyzing hands in egocentric vision, categorizing approaches into localization, interpretation, and application, and provides a list of datasets with hand-based annotations.

Egocentric vision (a.k.a. first-person vision - FPV) applications have thrived over the past few years, thanks to the availability of affordable wearable cameras and large annotated datasets. The position of the wearable camera (usually mounted on the head) allows recording exactly what the camera wearers have in front of them, in particular hands and manipulated objects. This intrinsic advantage enables the study of the hands from multiple perspectives: localizing hands and their parts within the images; understanding what actions and activities the hands are involved in; and developing human-computer interfaces that rely on hand gestures. In this survey, we review the literature that focuses on the hands using egocentric vision, categorizing the existing approaches into: localization (where are the hands or parts of them?); interpretation (what are the hands doing?); and application (e.g., systems that used egocentric hand cues for solving a specific problem). Moreover, a list of the most prominent datasets with hand-based annotations is provided.

Foundations

The foundational work for this paper's niche, ranked by how specifically the neighbourhood builds on it — not by global fame.

Your Notes