CVMar 15, 2022

SATS: Self-Attention Transfer for Continual Semantic Segmentation

arXiv:2203.07667v348 citationsh-index: 24
Originality Incremental advance
AI Analysis

This addresses the problem of forgetting old classes when learning new ones in semantic segmentation for intelligent systems, representing an incremental improvement over existing knowledge distillation strategies.

The paper tackles catastrophic forgetting in continual semantic segmentation by proposing a self-attention transfer method that captures relationships between image elements, and it shows significant performance improvements over state-of-the-art solutions on multiple benchmarks.

Continually learning to segment more and more types of image regions is a desired capability for many intelligent systems. However, such continual semantic segmentation suffers from the same catastrophic forgetting issue as in continual classification learning. While multiple knowledge distillation strategies originally for continual classification have been well adapted to continual semantic segmentation, they only consider transferring old knowledge based on the outputs from one or more layers of deep fully convolutional networks. Different from existing solutions, this study proposes to transfer a new type of information relevant to knowledge, i.e. the relationships between elements (Eg. pixels or small local regions) within each image which can capture both within-class and between-class knowledge. The relationship information can be effectively obtained from the self-attention maps in a Transformer-style segmentation model. Considering that pixels belonging to the same class in each image often share similar visual properties, a class-specific region pooling is applied to provide more efficient relationship information for knowledge transfer. Extensive evaluations on multiple public benchmarks support that the proposed self-attention transfer method can further effectively alleviate the catastrophic forgetting issue, and its flexible combination with one or more widely adopted strategies significantly outperforms state-of-the-art solutions.

Code Implementations1 repo
Foundations

The foundational work for this paper's niche, ranked by how specifically the neighbourhood builds on it — not by global fame.

Your Notes