CVMay 21, 2018

Batch-Instance Normalization for Adaptively Style-Invariant Neural Networks

arXiv:1805.07925v3241 citations
Originality Incremental advance
AI Analysis

This addresses style invariance for general visual recognition tasks, offering an incremental improvement over existing normalization methods.

The paper tackles the problem of visual style variability in image recognition by introducing Batch-Instance Normalization (BIN), which selectively normalizes disturbing styles while preserving useful ones, leading to improved recognition performance in various scenarios.

Real-world image recognition is often challenged by the variability of visual styles including object textures, lighting conditions, filter effects, etc. Although these variations have been deemed to be implicitly handled by more training data and deeper networks, recent advances in image style transfer suggest that it is also possible to explicitly manipulate the style information. Extending this idea to general visual recognition problems, we present Batch-Instance Normalization (BIN) to explicitly normalize unnecessary styles from images. Considering certain style features play an essential role in discriminative tasks, BIN learns to selectively normalize only disturbing styles while preserving useful styles. The proposed normalization module is easily incorporated into existing network architectures such as Residual Networks, and surprisingly improves the recognition performance in various scenarios. Furthermore, experiments verify that BIN effectively adapts to completely different tasks like object classification and style transfer, by controlling the trade-off between preserving and removing style variations. BIN can be implemented with only a few lines of code using popular deep learning frameworks.

Code Implementations3 repos
Foundations

The foundational work for this paper's niche, ranked by how specifically the neighbourhood builds on it — not by global fame.

Your Notes