Highway Driving Dataset for Semantic Video Segmentation
This work addresses the need for video-based datasets in autonomous driving, but it is incremental as it builds on existing semantic segmentation methods.
The authors tackled the lack of real-time semantic segmentation datasets for autonomous driving by introducing the Highway Driving dataset, a densely annotated video benchmark with 20 sequences at 30Hz, and proposed a baseline algorithm using temporal correlation.
Scene understanding is an essential technique in semantic segmentation. Although there exist several datasets that can be used for semantic segmentation, they are mainly focused on semantic image segmentation with large deep neural networks. Therefore, these networks are not useful for real time applications, especially in autonomous driving systems. In order to solve this problem, we make two contributions to semantic segmentation task. The first contribution is that we introduce the semantic video dataset, the Highway Driving dataset, which is a densely annotated benchmark for a semantic video segmentation task. The Highway Driving dataset consists of 20 video sequences having a 30Hz frame rate, and every frame is densely annotated. Secondly, we propose a baseline algorithm that utilizes a temporal correlation. Together with our attempt to analyze the temporal correlation, we expect the Highway Driving dataset to encourage research on semantic video segmentation.