LGOct 20, 2023

Fast hyperboloid decision tree algorithms

arXiv:2310.13841v27 citationsh-index: 53
Originality Incremental advance
AI Analysis

This work addresses scalability and efficiency issues in hyperbolic machine learning, providing practical tools for researchers and practitioners working with hierarchical data.

The paper tackles computational challenges in hyperbolic classifiers by introducing hyperDT, a hyperbolic decision tree algorithm that eliminates the need for Riemannian optimization and unstable operations, and extends it to hyperRF, a hyperbolic random forest model. The models demonstrate superior performance across diverse datasets, offering a fast and accurate toolkit for hyperbolic data analysis.

Hyperbolic geometry is gaining traction in machine learning for its effectiveness at capturing hierarchical structures in real-world data. Hyperbolic spaces, where neighborhoods grow exponentially, offer substantial advantages and consistently deliver state-of-the-art results across diverse applications. However, hyperbolic classifiers often grapple with computational challenges. Methods reliant on Riemannian optimization frequently exhibit sluggishness, stemming from the increased computational demands of operations on Riemannian manifolds. In response to these challenges, we present hyperDT, a novel extension of decision tree algorithms into hyperbolic space. Crucially, hyperDT eliminates the need for computationally intensive Riemannian optimization, numerically unstable exponential and logarithmic maps, or pairwise comparisons between points by leveraging inner products to adapt Euclidean decision tree algorithms to hyperbolic space. Our approach is conceptually straightforward and maintains constant-time decision complexity while mitigating the scalability issues inherent in high-dimensional Euclidean spaces. Building upon hyperDT we introduce hyperRF, a hyperbolic random forest model. Extensive benchmarking across diverse datasets underscores the superior performance of these models, providing a swift, precise, accurate, and user-friendly toolkit for hyperbolic data analysis.

Code Implementations1 repo
Foundations

The foundational work for this paper's niche, ranked by how specifically the neighbourhood builds on it — not by global fame.

Your Notes