LGCVJun 8, 2023

Scaling Spherical CNNs

arXiv:2306.05420v129 citationsh-index: 77Has Code
Originality Highly original
AI Analysis

This work enables spherical CNNs to handle larger-scale problems in domains like molecular analysis and weather prediction, representing a significant scaling advancement.

The authors tackled the problem of scaling spherical CNNs for larger applications by improving model components, hardware implementation, and input representations, achieving state-of-the-art results on the QM9 molecular benchmark and competitive performance on weather forecasting tasks.

Spherical CNNs generalize CNNs to functions on the sphere, by using spherical convolutions as the main linear operation. The most accurate and efficient way to compute spherical convolutions is in the spectral domain (via the convolution theorem), which is still costlier than the usual planar convolutions. For this reason, applications of spherical CNNs have so far been limited to small problems that can be approached with low model capacity. In this work, we show how spherical CNNs can be scaled for much larger problems. To achieve this, we make critical improvements including novel variants of common model components, an implementation of core operations to exploit hardware accelerator characteristics, and application-specific input representations that exploit the properties of our model. Experiments show our larger spherical CNNs reach state-of-the-art on several targets of the QM9 molecular benchmark, which was previously dominated by equivariant graph neural networks, and achieve competitive performance on multiple weather forecasting tasks. Our code is available at https://github.com/google-research/spherical-cnn.

Code Implementations1 repo
Foundations

The foundational work for this paper's niche, ranked by how specifically the neighbourhood builds on it — not by global fame.

Your Notes