MLITLGOct 16, 2023

On the Properties and Estimation of Pointwise Mutual Information Profiles

ETH Zurich
arXiv:2310.10240v26 citationsh-index: 66
Originality Incremental advance
AI Analysis

This work addresses limitations in mutual information estimation for researchers in statistics and machine learning, offering a novel modeling approach with practical tools for analysis.

The paper tackled the problem of estimating pointwise mutual information profiles by analytically describing them for multivariate normal distributions and introducing Bend and Mix Models for accurate Monte Carlo estimation, showing applications in evaluating mutual information estimators and providing Bayesian estimates with uncertainty quantification.

The pointwise mutual information profile, or simply profile, is the distribution of pointwise mutual information for a given pair of random variables. One of its important properties is that its expected value is precisely the mutual information between these random variables. In this paper, we analytically describe the profiles of multivariate normal distributions and introduce a novel family of distributions, Bend and Mix Models, for which the profile can be accurately estimated using Monte Carlo methods. We then show how Bend and Mix Models can be used to study the limitations of existing mutual information estimators, investigate the behavior of neural critics used in variational estimators, and understand the effect of experimental outliers on mutual information estimation. Finally, we show how Bend and Mix Models can be used to obtain model-based Bayesian estimates of mutual information, suitable for problems with available domain expertise in which uncertainty quantification is necessary.

Code Implementations1 repo
Foundations

The foundational work for this paper's niche, ranked by how specifically the neighbourhood builds on it — not by global fame.

Your Notes