CVMay 7, 2019

LEDNet: A Lightweight Encoder-Decoder Network for Real-Time Semantic Segmentation

arXiv:1905.02423v3379 citations
Originality Incremental advance
AI Analysis

This addresses the problem of enabling efficient semantic segmentation for mobile applications, though it is incremental as it builds on existing architectures like ResNet.

The paper tackles the computational burden of CNNs for real-time semantic segmentation on mobile devices by proposing LEDNet, a lightweight encoder-decoder network with under 1M parameters that achieves over 71 FPS on a GTX 1080Ti GPU and state-of-the-art speed-accuracy trade-off on the CityScapes dataset.

The extensive computational burden limits the usage of CNNs in mobile devices for dense estimation tasks. In this paper, we present a lightweight network to address this problem,namely LEDNet, which employs an asymmetric encoder-decoder architecture for the task of real-time semantic segmentation.More specifically, the encoder adopts a ResNet as backbone network, where two new operations, channel split and shuffle, are utilized in each residual block to greatly reduce computation cost while maintaining higher segmentation accuracy. On the other hand, an attention pyramid network (APN) is employed in the decoder to further lighten the entire network complexity. Our model has less than 1M parameters,and is able to run at over 71 FPS in a single GTX 1080Ti GPU. The comprehensive experiments demonstrate that our approach achieves state-of-the-art results in terms of speed and accuracy trade-off on CityScapes dataset.

Code Implementations6 repos
Foundations

The foundational work for this paper's niche, ranked by how specifically the neighbourhood builds on it — not by global fame.

Your Notes