Historical Consensus: Preventing Posterior Collapse via Iterative Selection of Gaussian Mixture Priors

arXiv:2603.10935v256.4h-index: 21Has Code

Predicted impact top 58% in LG · last 90 daysOriginality Highly original

AI Analysis

This addresses a fundamental problem in VAEs for machine learning researchers, offering a novel solution that avoids architectural constraints.

The paper tackles posterior collapse in variational autoencoders by introducing Historical Consensus Training, which eliminates collapse through iterative selection of Gaussian mixture priors, achieving non-collapsed representations regardless of decoder variance or regularization strength.

Variational autoencoders (VAEs) frequently suffer from posterior collapse, where latent variables become uninformative and the approximate posterior degenerates to the prior. Recent work has characterized this phenomenon as a phase transition governed by the spectral properties of the data covariance matrix. In this paper, we propose a fundamentally different approach: instead of avoiding collapse through architectural constraints or hyperparameter tuning, we eliminate the possibility of collapse altogether by leveraging the multiplicity of Gaussian mixture model (GMM) clusterings. We introduce Historical Consensus Training, an iterative selection procedure that progressively refines a set of candidate GMM priors through alternating optimization and selection. The key insight is that models trained to satisfy multiple distinct clustering constraints develop a historical barrier -- a region in parameter space that remains stable even when subsequently trained with a single objective. We prove that this barrier excludes the collapsed solution, and demonstrate through extensive experiments on synthetic and real-world datasets that our method achieves non-collapsed representations regardless of decoder variance or regularization strength. Our approach requires no explicit stability conditions (e.g., $σ^{\prime 2} < λ_{\max}$) and works with arbitrary neural architectures. The code is available at https://github.com/tsegoochang/historical-consensus-vae.

View on arXiv PDF Code

Similar