CVJun 6, 2018

Action4D: Real-time Action Recognition in the Crowd and Clutter

arXiv:1806.02424v11 citations
AI Analysis

This addresses the problem of reliable action recognition in real-world settings like smart homes and factories, though it appears incremental as it builds on existing action recognition with new methods for 4D data.

The paper tackles real-time action recognition for multiple people in crowded and cluttered environments by proposing Action4D, a method using holistic 4D scene scans and a novel deep neural network, achieving state-of-the-art results with fast and accurate performance.

Recognizing every person's action in a crowded and cluttered environment is a challenging task. In this paper, we propose a real-time action recognition method, Action4D, which gives reliable and accurate results in the real-world settings. We propose to tackle the action recognition problem using a holistic 4D "scan" of a cluttered scene to include every detail about the people and environment. Recognizing multiple people's actions in the cluttered 4D representation is a new problem. In this paper, we propose novel methods to solve this problem. We propose a new method to track people in 4D, which can reliably detect and follow each person in real time. We propose a new deep neural network, the Action4D-Net, to recognize the action of each tracked person. The Action4D-Net's novel structure uses both the global feature and the focused attention to achieve state-of-the-art result. Our real-time method is invariant to camera view angles, resistant to clutter and able to handle crowd. The experimental results show that the proposed method is fast, reliable and accurate. Our method paves the way to action recognition in the real-world applications and is ready to be deployed to enable smart homes, smart factories and smart stores.

Foundations

The foundational work for this paper's niche, ranked by how specifically the neighbourhood builds on it — not by global fame.

Your Notes