Feature Learning for Interaction Activity Recognition in RGBD Videos
This work addresses activity recognition for applications using RGBD cameras, but it is incremental as it builds on existing bag-of-visual-words and SVM approaches.
The paper tackles human activity recognition in RGBD videos by proposing a feature learning method based on 3D video data without domain knowledge, achieving results that outperform other techniques.
This paper proposes a human activity recognition method which is based on features learned from 3D video data without incorporating domain knowledge. The experiments on data collected by RGBD cameras produce results outperforming other techniques. Our feature encoding method follows the bag-of-visual-word model, then we use a SVM classifier to recognise the activities. We do not use skeleton or tracking information and the same technique is applied on color and depth data.