LGAIOct 29, 2025

IBNorm: Information-Bottleneck Inspired Normalization for Representation Learning

arXiv:2510.25262v1h-index: 2
Originality Highly original
AI Analysis

This addresses a foundational issue in deep learning normalization for representation learning, offering a novel approach that improves model performance across domains.

The paper tackled the problem of existing normalization methods being variance-centric without controlling task-relevant information capture, proposing IBNorm based on the Information Bottleneck principle to preserve predictive information while suppressing nuisance variability, resulting in consistent outperformance over BatchNorm, LayerNorm, and RMSNorm across large-scale language and vision models.

Normalization is fundamental to deep learning, but existing approaches such as BatchNorm, LayerNorm, and RMSNorm are variance-centric by enforcing zero mean and unit variance, stabilizing training without controlling how representations capture task-relevant information. We propose IB-Inspired Normalization (IBNorm), a simple yet powerful family of methods grounded in the Information Bottleneck principle. IBNorm introduces bounded compression operations that encourage embeddings to preserve predictive information while suppressing nuisance variability, yielding more informative representations while retaining the stability and compatibility of standard normalization. Theoretically, we prove that IBNorm achieves a higher IB value and tighter generalization bounds than variance-centric methods. Empirically, IBNorm consistently outperforms BatchNorm, LayerNorm, and RMSNorm across large-scale language models (LLaMA, GPT-2) and vision models (ResNet, ViT), with mutual information analysis confirming superior information bottleneck behavior. Code will be released publicly.

Foundations

The foundational work for this paper's niche, ranked by how specifically the neighbourhood builds on it — not by global fame.

Your Notes