ML LGFeb 29, 2020

Differentiating through the Fréchet Mean

Aaron Lou, Isay Katsman, Qingxuan Jiang, Serge Belongie, Ser-Nam Lim, Christopher De Sa

arXiv:2003.00335v418.884 citationsHas Code

Originality Incremental advance

AI Analysis

This solves the challenge of applying geometric means in deep learning for hyperbolic spaces, with incremental improvements in specific domains like graph networks.

The paper tackled the problem of differentiating through the Fréchet mean on Riemannian manifolds, enabling its integration into hyperbolic neural networks, resulting in state-of-the-art performance on high-hyperbolicity datasets and improved batch normalization.

Recent advances in deep representation learning on Riemannian manifolds extend classical deep learning operations to better capture the geometry of the manifold. One possible extension is the Fréchet mean, the generalization of the Euclidean mean; however, it has been difficult to apply because it lacks a closed form with an easily computable derivative. In this paper, we show how to differentiate through the Fréchet mean for arbitrary Riemannian manifolds. Then, focusing on hyperbolic space, we derive explicit gradient expressions and a fast, accurate, and hyperparameter-free Fréchet mean solver. This fully integrates the Fréchet mean into the hyperbolic neural network pipeline. To demonstrate this integration, we present two case studies. First, we apply our Fréchet mean to the existing Hyperbolic Graph Convolutional Network, replacing its projected aggregation to obtain state-of-the-art results on datasets with high hyperbolicity. Second, to demonstrate the Fréchet mean's capacity to generalize Euclidean neural network operations, we develop a hyperbolic batch normalization method that gives an improvement parallel to the one observed in the Euclidean setting.

View on arXiv PDF Code

Similar