LGMLOct 5, 2020

DEMI: Discriminative Estimator of Mutual Information

arXiv:2010.01766v28 citationsHas Code
AI Analysis

This addresses a fundamental bottleneck in machine learning for researchers and practitioners dealing with high-dimensional data, offering a more reliable estimator compared to existing variational methods.

The paper tackles the challenging problem of estimating mutual information for high-dimensional continuous data by introducing a classifier-based method that avoids variational bounds, achieving high accuracy and demonstrating advantages in representation learning.

Estimating mutual information between continuous random variables is often intractable and extremely challenging for high-dimensional data. Recent progress has leveraged neural networks to optimize variational lower bounds on mutual information. Although showing promise for this difficult problem, the variational methods have been theoretically and empirically proven to have serious statistical limitations: 1) many methods struggle to produce accurate estimates when the underlying mutual information is either low or high; 2) the resulting estimators may suffer from high variance. Our approach is based on training a classifier that provides the probability that a data sample pair is drawn from the joint distribution rather than from the product of its marginal distributions. Moreover, we establish a direct connection between mutual information and the average log odds estimate produced by the classifier on a test set, leading to a simple and accurate estimator of mutual information. We show theoretically that our method and other variational approaches are equivalent when they achieve their optimum, while our method sidesteps the variational bound. Empirical results demonstrate high accuracy of our approach and the advantages of our estimator in the context of representation learning. Our demo is available at https://github.com/RayRuizhiLiao/demi_mi_estimator.

Code Implementations1 repo
Foundations

The foundational work for this paper's niche, ranked by how specifically the neighbourhood builds on it — not by global fame.

Your Notes