LG RT MLMay 22, 2024

Deep Ridgelet Transform and Unified Universality Theorem for Deep and Shallow Joint-Group-Equivariant Machines

Sho Sonoda, Yuka Hashimoto, Isao Ishikawa, Masahiro Ikeda

arXiv:2405.13682v56.42 citationsh-index: 12ICML

Originality Incremental advance

AI Analysis

This work offers a foundational unification of approximation theory for various neural network architectures, addressing a theoretical gap in machine learning.

The authors tackled the problem of unifying universal approximation theorems for deep and shallow neural networks by introducing joint-group-equivariant machines and a constructive theorem based on the ridgelet transform. They demonstrated the universality of four network types, including a new depth-2 network with quadratic forms, providing a common framework for understanding approximation schemes.

We present a constructive universal approximation theorem for learning machines equipped with joint-group-equivariant feature maps, called the joint-equivariant machines, based on the group representation theory. ``Constructive'' here indicates that the distribution of parameters is given in a closed-form expression known as the ridgelet transform. Joint-group-equivariance encompasses a broad class of feature maps that generalize classical group-equivariance. Particularly, fully-connected networks are not group-equivariant but are joint-group-equivariant. Our main theorem also unifies the universal approximation theorems for both shallow and deep networks. Until this study, the universality of deep networks has been shown in a different manner from the universality of shallow networks, but our results discuss them on common ground. Now we can understand the approximation schemes of various learning machines in a unified manner. As applications, we show the constructive universal approximation properties of four examples: depth-$n$ joint-equivariant machine, depth-$n$ fully-connected network, depth-$n$ group-convolutional network, and a new depth-$2$ network with quadratic forms whose universality has not been known.

View on arXiv PDF

Similar