LGJun 28, 2021

Hyperbolic Busemann Learning with Ideal Prototypes

arXiv:2106.14472v259 citations
Originality Incremental advance
AI Analysis

This work addresses a limitation in hyperbolic representation learning for classification, offering a method that does not need privileged label information, though it is incremental as it builds on existing prototype-based approaches.

The paper tackles the problem of hyperbolic prototype classification requiring prior label knowledge by positioning prototypes on the ideal boundary of the Poincaré ball, resulting in outperformance over recent hyperspherical and hyperbolic prototype approaches.

Hyperbolic space has become a popular choice of manifold for representation learning of various datatypes from tree-like structures and text to graphs. Building on the success of deep learning with prototypes in Euclidean and hyperspherical spaces, a few recent works have proposed hyperbolic prototypes for classification. Such approaches enable effective learning in low-dimensional output spaces and can exploit hierarchical relations amongst classes, but require privileged information about class labels to position the hyperbolic prototypes. In this work, we propose Hyperbolic Busemann Learning. The main idea behind our approach is to position prototypes on the ideal boundary of the Poincaré ball, which does not require prior label knowledge. To be able to compute proximities to ideal prototypes, we introduce the penalised Busemann loss. We provide theory supporting the use of ideal prototypes and the proposed loss by proving its equivalence to logistic regression in the one-dimensional case. Empirically, we show that our approach provides a natural interpretation of classification confidence, while outperforming recent hyperspherical and hyperbolic prototype approaches.

Code Implementations1 repo
Foundations

The foundational work for this paper's niche, ranked by how specifically the neighbourhood builds on it — not by global fame.

Your Notes