CVMar 6, 2021

Learning Statistical Texture for Semantic Segmentation

arXiv:2103.04133v1148 citations
Originality Incremental advance
AI Analysis

This work addresses the challenge of improving boundary precision in semantic segmentation for computer vision applications, representing an incremental advancement by enhancing texture feature utilization.

The paper tackles the problem of semantic segmentation by proposing a Statistical Texture Learning Network (STLNet) that leverages global statistical knowledge from low-level texture features, achieving state-of-the-art performance on Cityscapes, PASCAL Context, and ADE20K benchmarks.

Existing semantic segmentation works mainly focus on learning the contextual information in high-level semantic features with CNNs. In order to maintain a precise boundary, low-level texture features are directly skip-connected into the deeper layers. Nevertheless, texture features are not only about local structure, but also include global statistical knowledge of the input image. In this paper, we fully take advantages of the low-level texture features and propose a novel Statistical Texture Learning Network (STLNet) for semantic segmentation. For the first time, STLNet analyzes the distribution of low level information and efficiently utilizes them for the task. Specifically, a novel Quantization and Counting Operator (QCO) is designed to describe the texture information in a statistical manner. Based on QCO, two modules are introduced: (1) Texture Enhance Module (TEM), to capture texture-related information and enhance the texture details; (2) Pyramid Texture Feature Extraction Module (PTFEM), to effectively extract the statistical texture features from multiple scales. Through extensive experiments, we show that the proposed STLNet achieves state-of-the-art performance on three semantic segmentation benchmarks: Cityscapes, PASCAL Context and ADE20K.

Code Implementations1 repo
Foundations

The foundational work for this paper's niche, ranked by how specifically the neighbourhood builds on it — not by global fame.

Your Notes