CVApr 3, 2018

Dynamic Video Segmentation Network

arXiv:1804.00931v2141 citations
Originality Incremental advance
AI Analysis

This work addresses the need for real-time video segmentation in applications like autonomous driving, offering a significant speed-accuracy trade-off, though it is incremental as it builds on existing segmentation and flow networks.

The paper tackles the problem of efficient semantic video segmentation by proposing DVSNet, which uses a decision network to adaptively route frame regions between a slow, accurate segmentation network and a fast, less accurate flow network, achieving up to 70.4% mIoU at 19.8 fps on Cityscapes and reducing computational workloads by up to 95%.

In this paper, we present a detailed design of dynamic video segmentation network (DVSNet) for fast and efficient semantic video segmentation. DVSNet consists of two convolutional neural networks: a segmentation network and a flow network. The former generates highly accurate semantic segmentations, but is deeper and slower. The latter is much faster than the former, but its output requires further processing to generate less accurate semantic segmentations. We explore the use of a decision network to adaptively assign different frame regions to different networks based on a metric called expected confidence score. Frame regions with a higher expected confidence score traverse the flow network. Frame regions with a lower expected confidence score have to pass through the segmentation network. We have extensively performed experiments on various configurations of DVSNet, and investigated a number of variants for the proposed decision network. The experimental results show that our DVSNet is able to achieve up to 70.4% mIoU at 19.8 fps on the Cityscape dataset. A high speed version of DVSNet is able to deliver an fps of 30.4 with 63.2% mIoU on the same dataset. DVSNet is also able to reduce up to 95% of the computational workloads.

Foundations

The foundational work for this paper's niche, ranked by how specifically the neighbourhood builds on it — not by global fame.

Your Notes