CVJul 16, 2020

A New Look at Ghost Normalization

arXiv:2007.08554v19 citations
AI Analysis

This work addresses a specific optimization problem in neural networks for researchers and practitioners, offering incremental improvements over existing normalization techniques.

The paper tackles the performance degradation of batch normalization at small batch sizes by analyzing ghost normalization, uncovering a unique regularization source and introducing sequential normalization, which achieves superior performance on CIFAR-10 and CIFAR-100 datasets.

Batch normalization (BatchNorm) is an effective yet poorly understood technique for neural network optimization. It is often assumed that the degradation in BatchNorm performance to smaller batch sizes stems from it having to estimate layer statistics using smaller sample sizes. However, recently, Ghost normalization (GhostNorm), a variant of BatchNorm that explicitly uses smaller sample sizes for normalization, has been shown to improve upon BatchNorm in some datasets. Our contributions are: (i) we uncover a source of regularization that is unique to GhostNorm, and not simply an extension from BatchNorm, (ii) three types of GhostNorm implementations are described, two of which employ BatchNorm as the underlying normalization technique, (iii) by visualising the loss landscape of GhostNorm, we observe that GhostNorm consistently decreases the smoothness when compared to BatchNorm, (iv) we introduce Sequential Normalization (SeqNorm), and report superior performance over state-of-the-art methodologies on both CIFAR--10 and CIFAR--100 datasets.

Foundations

The foundational work for this paper's niche, ranked by how specifically the neighbourhood builds on it — not by global fame.

Your Notes