CVMay 7, 2019

LEDNet: A Lightweight Encoder-Decoder Network for Real-Time Semantic Segmentation

Yu Wang, Quan Zhou, Jia Liu, Jian Xiong, Guangwei Gao, Xiaofu Wu, Longin Jan Latecki

arXiv:1905.02423v320.2379 citations

Originality Incremental advance

AI Analysis

This addresses the problem of enabling efficient semantic segmentation for mobile applications, though it is incremental as it builds on existing architectures like ResNet.

The paper tackles the computational burden of CNNs for real-time semantic segmentation on mobile devices by proposing LEDNet, a lightweight encoder-decoder network with under 1M parameters that achieves over 71 FPS on a GTX 1080Ti GPU and state-of-the-art speed-accuracy trade-off on the CityScapes dataset.

The extensive computational burden limits the usage of CNNs in mobile devices for dense estimation tasks. In this paper, we present a lightweight network to address this problem,namely LEDNet, which employs an asymmetric encoder-decoder architecture for the task of real-time semantic segmentation.More specifically, the encoder adopts a ResNet as backbone network, where two new operations, channel split and shuffle, are utilized in each residual block to greatly reduce computation cost while maintaining higher segmentation accuracy. On the other hand, an attention pyramid network (APN) is employed in the decoder to further lighten the entire network complexity. Our model has less than 1M parameters,and is able to run at over 71 FPS in a single GTX 1080Ti GPU. The comprehensive experiments demonstrate that our approach achieves state-of-the-art results in terms of speed and accuracy trade-off on CityScapes dataset.

View on arXiv PDF

Similar