CVAIFeb 1, 2022

Dilated Continuous Random Field for Semantic Segmentation

arXiv:2202.00162v1
Originality Incremental advance
AI Analysis

This work addresses semantic segmentation refinement for computer vision applications, presenting an incremental improvement over existing CRF methods.

The paper tackles the refinement of semantic segmentation by relaxing the hard constraint of mean field approximation in Continuous Random Fields (CRF) through a global optimization with a dilated sparse convolution module, achieving superior results on the suction dataset compared to other CRF-based approaches.

Mean field approximation methodology has laid the foundation of modern Continuous Random Field (CRF) based solutions for the refinement of semantic segmentation. In this paper, we propose to relax the hard constraint of mean field approximation - minimizing the energy term of each node from probabilistic graphical model, by a global optimization with the proposed dilated sparse convolution module (DSConv). In addition, adaptive global average-pooling and adaptive global max-pooling are implemented as replacements of fully connected layers. In order to integrate DSConv, we design an end-to-end, time-efficient DilatedCRF pipeline. The unary energy term is derived either from pre-softmax and post-softmax features, or the predicted affordance map using a conventional classifier, making it easier to implement DilatedCRF for varieties of classifiers. We also present superior experimental results of proposed approach on the suction dataset comparing to other CRF-based approaches.

Code Implementations1 repo
Foundations

The foundational work for this paper's niche, ranked by how specifically the neighbourhood builds on it — not by global fame.

Your Notes