LG MLFeb 25, 2020

A Theory of Usable Information Under Computational Constraints

Yilun Xu, Shengjia Zhao, Jiaming Song, Russell Stewart, Stefano Ermon

arXiv:2002.10689v129.8213 citations

Originality Highly original

AI Analysis

This work addresses a foundational problem in machine learning and information theory by providing a more practical measure of information that can be created through computation, which is incremental but has broad implications for representation learning and high-dimensional data analysis.

The paper tackles the problem of reasoning about information in complex systems by proposing a new framework called predictive V-information, which extends Shannon's information theory to account for computational constraints and modeling power, and demonstrates its effectiveness in structure learning and fair representation learning.

We propose a new framework for reasoning about information in complex systems. Our foundation is based on a variational extension of Shannon's information theory that takes into account the modeling power and computational constraints of the observer. The resulting \emph{predictive $\mathcal{V}$-information} encompasses mutual information and other notions of informativeness such as the coefficient of determination. Unlike Shannon's mutual information and in violation of the data processing inequality, $\mathcal{V}$-information can be created through computation. This is consistent with deep neural networks extracting hierarchies of progressively more informative features in representation learning. Additionally, we show that by incorporating computational constraints, $\mathcal{V}$-information can be reliably estimated from data even in high dimensions with PAC-style guarantees. Empirically, we demonstrate predictive $\mathcal{V}$-information is more effective than mutual information for structure learning and fair representation learning.

View on arXiv PDF

Similar