MLAILGMay 18

Markov Chain Decoders Overcome the Heavy-Tail Limitations of Lipschitz Generative Models

arXiv:2605.1893137.0
Predicted impact top 56% in ML · last 90 daysOriginality Incremental advance
AI Analysis

Provides a principled solution to a fundamental limitation of VAEs for heavy-tailed data, which is important for risk modeling and network traffic analysis.

Heavy-tailed distributions challenge deep generative models; the authors prove that Gaussian decoders with Lipschitz constraints cannot generate heavy tails. Replacing the Gaussian decoder with a Phase-Type distribution reduces tail Kolmogorov-Smirnov distance by up to 6x and extreme quantile error by up to 10x on synthetic Pareto data.

Heavy-tailed distributions are prevalent in performance evaluation, network traffic, and risk modeling. This behavior poses a fundamental challenge for modern deep generative models. Standard Variational Autoencoders (VAEs) employ Gaussian decoder likelihoods and Lipschitz-constrained neural networks, a combination that is structurally incapable of producing heavy-tailed outputs: the Gaussian tail decays exponentially, and Lipschitz continuity prevents the decoder from amplifying rare events from the latent space input to sufficiently overcome this decay. We provide both a theoretical characterization of this limitation and a controlled empirical demonstration using synthetic Pareto data across a grid of tail indices $α$ $\in$ {2, 3, 5, 30} and dimensions d $\in$ {1, 5, 10}. As a solution, we replace the Gaussian decoder with a Phase-Type (PH) distribution based on Markov chains, while keeping the encoder, latent space, and training procedure identical. PH distributions allow for arbitrarily precise approximations of any positive-valued distributions, including heavy-tailed families. Experiments showed that the PH-based model reduces tail Kolmogorov-Smirnov distance by up to x6 and extreme quantile error by up to x10 compared to the Gaussian baseline for heavy-tailed data. These results demonstrate that integrating Markov chain-based distributions into the decoder of a generative model institutes a principled and practically effective solution to the heavy-tail generation problem.

Foundations

The foundational work for this paper's niche, ranked by how specifically the neighbourhood builds on it — not by global fame.

Your Notes