LGITMLOct 14, 2019

Understanding the Limitations of Variational Mutual Information Estimators

arXiv:1910.06222v2243 citations
Originality Incremental advance
AI Analysis

This addresses practical limitations in mutual information estimation for machine learning researchers, though it is incremental as it builds on existing variational approaches.

The paper tackled the problem of high variance and bias in variational mutual information estimators, showing that existing methods can fail basic consistency properties, and proposed a new estimator that improves bias-variance trade-offs on benchmark tasks.

Variational approaches based on neural networks are showing promise for estimating mutual information (MI) between high dimensional variables. However, they can be difficult to use in practice due to poorly understood bias/variance tradeoffs. We theoretically show that, under some conditions, estimators such as MINE exhibit variance that could grow exponentially with the true amount of underlying MI. We also empirically demonstrate that existing estimators fail to satisfy basic self-consistency properties of MI, such as data processing and additivity under independence. Based on a unified perspective of variational approaches, we develop a new estimator that focuses on variance reduction. Empirical results on standard benchmark tasks demonstrate that our proposed estimator exhibits improved bias-variance trade-offs on standard benchmark tasks.

Code Implementations1 repo
Foundations

The foundational work for this paper's niche, ranked by how specifically the neighbourhood builds on it — not by global fame.

Your Notes