LGMLApr 7, 2020

Capsule Networks -- A Probabilistic Perspective

arXiv:2004.03553v36 citations
AI Analysis

This work provides a theoretical framework for capsule networks, which could improve robustness in computer vision tasks, but it appears incremental as it builds on existing capsule assumptions.

The paper tackles the problem of making capsule networks robust to viewpoint changes by proposing a probabilistic generative model that separates generative and inference components, and demonstrates the use of test-time optimization to address amortized inference issues.

'Capsule' models try to explicitly represent the poses of objects, enforcing a linear relationship between an object's pose and that of its constituent parts. This modelling assumption should lead to robustness to viewpoint changes since the sub-object/super-object relationships are invariant to the poses of the object. We describe a probabilistic generative model which encodes such capsule assumptions, clearly separating the generative parts of the model from the inference mechanisms. With a variational bound we explore the properties of the generative model independently of the approximate inference scheme, and gain insights into failures of the capsule assumptions and inference amortisation. We experimentally demonstrate the applicability of our unified objective, and demonstrate the use of test time optimisation to solve problems inherent to amortised inference in our model.

Foundations

The foundational work for this paper's niche, ranked by how specifically the neighbourhood builds on it — not by global fame.

Your Notes