A Short Note on the Kinetics-700 Human Action Dataset
This work provides an incremental update to a dataset for human action recognition research.
The authors extended the DeepMind Kinetics human action dataset from 600 to 700 classes, each with at least 600 video clips from YouTube, and provided baseline results using the I3D neural network architecture.
We describe an extension of the DeepMind Kinetics human action dataset from 600 classes to 700 classes, where for each class there are at least 600 video clips from different YouTube videos. This paper details the changes introduced for this new release of the dataset, and includes a comprehensive set of statistics as well as baseline results using the I3D neural network architecture.