CVAug 26, 2016

Fine Hand Segmentation using Convolutional Neural Networks

arXiv:1608.07454v117 citations
Originality Incremental advance
AI Analysis

This work addresses the need for precise hand segmentation in egocentric vision applications, such as augmented reality or human-computer interaction, with an incremental improvement in method design.

The paper tackles the problem of extracting accurate hand masks in egocentric views by proposing a novel deep learning architecture that avoids upscaling layers and maps convolutional features directly to segmentation masks, achieving real-time efficiency and accuracy on a new diverse dataset.

We propose a method for extracting very accurate masks of hands in egocentric views. Our method is based on a novel Deep Learning architecture: In contrast with current Deep Learning methods, we do not use upscaling layers applied to a low-dimensional representation of the input image. Instead, we extract features with convolutional layers and map them directly to a segmentation mask with a fully connected layer. We show that this approach, when applied in a multi-scale fashion, is both accurate and efficient enough for real-time. We demonstrate it on a new dataset made of images captured in various environments, from the outdoors to offices.

Foundations

The foundational work for this paper's niche, ranked by how specifically the neighbourhood builds on it — not by global fame.

Your Notes