Collecting and Annotating the Large Continuous Action Dataset
This provides a new dataset to advance action-recognition research, though it is incremental as it builds on existing dataset efforts.
The authors introduced the LCA dataset for action recognition, which is larger and contains overlapping actions in streaming video with uniform backgrounds, and found that state-of-the-art methods perform poorly on it, indicating it will be a challenging benchmark.
We make available to the community a new dataset to support action-recognition research. This dataset is different from prior datasets in several key ways. It is significantly larger. It contains streaming video with long segments containing multiple action occurrences that often overlap in space and/or time. All actions were filmed in the same collection of backgrounds so that background gives little clue as to action class. We had five humans replicate the annotation of temporal extent of action occurrences labeled with their class and measured a surprisingly low level of intercoder agreement. A baseline experiment shows that recent state-of-the-art methods perform poorly on this dataset. This suggests that this will be a challenging dataset to foster advances in action-recognition research. This manuscript serves to describe the novel content and characteristics of the LCA dataset, present the design decisions made when filming the dataset, and document the novel methods employed to annotate the dataset.