CVSep 13, 2019

Dual Graph Convolutional Network for Semantic Segmentation

Li Zhang, Xiangtai Li, Anurag Arnab, Kuiyuan Yang, Yunhai Tong, Philip H. S. Torr

arXiv:1909.06121v325.5193 citationsHas Code

Originality Highly original

AI Analysis

This addresses the challenge of global context modeling in semantic segmentation for computer vision applications, representing a novel method rather than an incremental improvement.

The paper tackles the problem of exploiting long-range contextual information for semantic segmentation by proposing a Dual Graph Convolutional Network (DGCNet) that models spatial and channel interdependencies, achieving state-of-the-art results with 82.0% mean IoU on Cityscapes and 53.7% mean IoU on Pascal Context.

Exploiting long-range contextual information is key for pixel-wise prediction tasks such as semantic segmentation. In contrast to previous work that uses multi-scale feature fusion or dilated convolutions, we propose a novel graph-convolutional network (GCN) to address this problem. Our Dual Graph Convolutional Network (DGCNet) models the global context of the input feature by modelling two orthogonal graphs in a single framework. The first component models spatial relationships between pixels in the image, whilst the second models interdependencies along the channel dimensions of the network's feature map. This is done efficiently by projecting the feature into a new, lower-dimensional space where all pairwise interactions can be modelled, before reprojecting into the original space. Our simple method provides substantial benefits over a strong baseline and achieves state-of-the-art results on both Cityscapes (82.0% mean IoU) and Pascal Context (53.7% mean IoU) datasets. Code and models are made available to foster any further research (\url{https://github.com/lxtGH/GALD-DGCNet}).

View on arXiv PDF Code

Similar