CVApr 21, 2020

Multi-view Self-Constructing Graph Convolutional Networks with Adaptive Class Weighting Loss for Semantic Segmentation

arXiv:2004.10327v141 citationsHas Code
AI Analysis

This work addresses class imbalance and rotational invariance in semantic segmentation for agriculture, but it is incremental as it builds on existing self-constructing graph modules.

The paper tackles semantic segmentation in airborne images by proposing a multi-view self-constructing graph convolutional network with an adaptive class weighting loss, achieving competitive results of 0.547 mIoU with reduced parameters and computational cost.

We propose a novel architecture called the Multi-view Self-Constructing Graph Convolutional Networks (MSCG-Net) for semantic segmentation. Building on the recently proposed Self-Constructing Graph (SCG) module, which makes use of learnable latent variables to self-construct the underlying graphs directly from the input features without relying on manually built prior knowledge graphs, we leverage multiple views in order to explicitly exploit the rotational invariance in airborne images. We further develop an adaptive class weighting loss to address the class imbalance. We demonstrate the effectiveness and flexibility of the proposed method on the Agriculture-Vision challenge dataset and our model achieves very competitive results (0.547 mIoU) with much fewer parameters and at a lower computational cost compared to related pure-CNN based work. Code will be available at: github.com/samleoqh/MSCG-Net

Code Implementations2 repos
Foundations

The foundational work for this paper's niche, ranked by how specifically the neighbourhood builds on it — not by global fame.

Your Notes