CVLGDec 12, 2019

Local Context Normalization: Revisiting Local Normalization

arXiv:1912.05845v329 citations
Originality Incremental advance
AI Analysis

This addresses a specific issue in computer vision where large or variable-sized images degrade performance with standard normalization, offering a domain-specific improvement for tasks like satellite imagery analysis.

The paper tackles the problem of normalization layers washing out important spatial signals in vision applications by proposing Local Context Normalization (LCN), which normalizes features based on local windows and group information, resulting in improved performance over existing methods like Batch Normalization and Group Normalization in object detection, semantic segmentation, and instance segmentation across benchmark datasets.

Normalization layers have been shown to improve convergence in deep neural networks, and even add useful inductive biases. In many vision applications the local spatial context of the features is important, but most common normalization schemes including Group Normalization (GN), Instance Normalization (IN), and Layer Normalization (LN) normalize over the entire spatial dimension of a feature. This can wash out important signals and degrade performance. For example, in applications that use satellite imagery, input images can be arbitrarily large; consequently, it is nonsensical to normalize over the entire area. Positional Normalization (PN), on the other hand, only normalizes over a single spatial position at a time. A natural compromise is to normalize features by local context, while also taking into account group level information. In this paper, we propose Local Context Normalization (LCN): a normalization layer where every feature is normalized based on a window around it and the filters in its group. We propose an algorithmic solution to make LCN efficient for arbitrary window sizes, even if every point in the image has a unique window. LCN outperforms its Batch Normalization (BN), GN, IN, and LN counterparts for object detection, semantic segmentation, and instance segmentation applications in several benchmark datasets, while keeping performance independent of the batch size and facilitating transfer learning.

Code Implementations1 repo
Foundations

The foundational work for this paper's niche, ranked by how specifically the neighbourhood builds on it — not by global fame.

Your Notes