LG SOFT ITApr 15, 2022

The Distributed Information Bottleneck reveals the explanatory structure of complex systems

arXiv:2204.07576v17.87 citationsh-index: 105

Originality Highly original

AI Analysis

This provides a principled framework for interpretable deep learning in science, addressing a key bottleneck for researchers in fields like applied mathematics and physics.

The paper tackles the problem of interpretability in deep learning for scientific applications by introducing the Distributed Information Bottleneck, which distributes bottlenecks across input components to deconstruct relationships into meaningful approximations. The result demonstrates utility in systems like Boolean circuits and sheared glasses, enabling interpretable analysis without custom datasets or architectures.

The fruits of science are relationships made comprehensible, often by way of approximation. While deep learning is an extremely powerful way to find relationships in data, its use in science has been hindered by the difficulty of understanding the learned relationships. The Information Bottleneck (IB) is an information theoretic framework for understanding a relationship between an input and an output in terms of a trade-off between the fidelity and complexity of approximations to the relationship. Here we show that a crucial modification -- distributing bottlenecks across multiple components of the input -- opens fundamentally new avenues for interpretable deep learning in science. The Distributed Information Bottleneck throttles the downstream complexity of interactions between the components of the input, deconstructing a relationship into meaningful approximations found through deep learning without requiring custom-made datasets or neural network architectures. Applied to a complex system, the approximations illuminate aspects of the system's nature by restricting -- and monitoring -- the information about different components incorporated into the approximation. We demonstrate the Distributed IB's explanatory utility in systems drawn from applied mathematics and condensed matter physics. In the former, we deconstruct a Boolean circuit into approximations that isolate the most informative subsets of input components without requiring exhaustive search. In the latter, we localize information about future plastic rearrangement in the static structure of a sheared glass, and find the information to be more or less diffuse depending on the system's preparation. By way of a principled scheme of approximations, the Distributed IB brings much-needed interpretability to deep learning and enables unprecedented analysis of information flow through a system.

View on arXiv PDF

Similar