CVAISep 19, 2023

An Empirical Study of Attention Networks for Semantic Segmentation

arXiv:2309.10217v11 citationsh-index: 3
Originality Synthesis-oriented
AI Analysis

It addresses practical engineering needs by systematically evaluating attention networks beyond just mIoU comparisons, though it is incremental in nature.

This paper analyzes attention networks for semantic segmentation by comparing their computational complexity (FLOPs, memory) and performance across categories, identifying suitable application scenarios and providing construction guidelines.

Semantic segmentation is a vital problem in computer vision. Recently, a common solution to semantic segmentation is the end-to-end convolution neural network, which is much more accurate than traditional methods.Recently, the decoders based on attention achieve state-of-the-art (SOTA) performance on various datasets. But these networks always are compared with the mIoU of previous SOTA networks to prove their superiority and ignore their characteristics without considering the computation complexity and precision in various categories, which is essential for engineering applications. Besides, the methods to analyze the FLOPs and memory are not consistent between different networks, which makes the comparison hard to be utilized. What's more, various methods utilize attention in semantic segmentation, but the conclusion of these methods is lacking. This paper first conducts experiments to analyze their computation complexity and compare their performance. Then it summarizes suitable scenes for these networks and concludes key points that should be concerned when constructing an attention network. Last it points out some future directions of the attention network.

Foundations

The foundational work for this paper's niche, ranked by how specifically the neighbourhood builds on it — not by global fame.

Your Notes