LGJul 25, 2022

Effective and Interpretable Information Aggregation with Capacity Networks

arXiv:2207.12013v11.8h-index: 8

Originality Incremental advance

AI Analysis

This addresses the need for more effective and interpretable models in multiple instance learning, though it appears incremental as it builds on existing encoder-decoder strategies.

The paper tackles the problem of aggregating information from multiple instances in multiple instance learning by proposing Capacity networks, which improve over encoder-decoder architectures and provide interpretable intermediate results.

How to aggregate information from multiple instances is a key question multiple instance learning. Prior neural models implement different variants of the well-known encoder-decoder strategy according to which all input features are encoded a single, high-dimensional embedding which is then decoded to generate an output. In this work, inspired by Choquet capacities, we propose Capacity networks. Unlike encoder-decoders, Capacity networks generate multiple interpretable intermediate results which can be aggregated in a semantically meaningful space to obtain the final output. Our experiments show that implementing this simple inductive bias leads to improvements over different encoder-decoder architectures in a wide range of experiments. Moreover, the interpretable intermediate results make Capacity networks interpretable by design, which allows a semantically meaningful inspection, evaluation, and regularization of the network internals.

View on arXiv PDF

Similar