CVSep 19, 2018

Detect, anticipate and generate: Semi-supervised recurrent latent variable models for human activity modeling

arXiv:1809.07075v1
Originality Incremental advance
AI Analysis

This work addresses the challenge of predicting human actions for robots, which is crucial for effective collaboration, but it appears incremental as it builds on existing latent variable and recurrent models.

The paper tackled the problem of modeling human behavior for human-robot collaboration by developing a semi-supervised variational recurrent neural network that models latent factors and observable features, incorporating discrete labels when available. The result showed that the model outperformed state-of-the-art approaches in activity and affordance detection and anticipation on the CAD-120 dataset.

Successful Human-Robot collaboration requires a predictive model of human behavior. The robot needs to be able to recognize current goals and actions and to predict future activities in a given context. However, the spatio-temporal sequence of human actions is difficult to model since latent factors such as intention, task, knowledge, intuition and preference determine the action choices of each individual. In this work we introduce semi-supervised variational recurrent neural networks which are able to a) model temporal distributions over latent factors and the observable feature space, b) incorporate discrete labels such as activity type when available, and c) generate possible future action sequences on both feature and label level. We evaluate our model on the Cornell Activity Dataset CAD-120 dataset. Our model outperforms state-of-the-art approaches in both activity and affordance detection and anticipation. Additionally, we show how samples of possible future action sequences are in line with past observations.

Foundations

The foundational work for this paper's niche, ranked by how specifically the neighbourhood builds on it — not by global fame.

Your Notes