NALGMLMay 18, 2018

Butterfly-Net: Optimal Function Representation Based on Convolutional Neural Networks

arXiv:1805.07451v423 citations
Originality Incremental advance
AI Analysis

It addresses the problem of high computational complexity in neural networks for scientific applications by providing a theoretically grounded, efficient method, though it is incremental as it builds on existing CNN frameworks.

The paper introduces Butterfly-Net, a low-complexity CNN with structured sparse connections, to approximate Fourier representations, showing exponential error decay with depth and improved performance over random initialization in numerical experiments.

Deep networks, especially convolutional neural networks (CNNs), have been successfully applied in various areas of machine learning as well as to challenging problems in other scientific and engineering fields. This paper introduces Butterfly-Net, a low-complexity CNN with structured and sparse cross-channel connections, together with a Butterfly initialization strategy for a family of networks. Theoretical analysis of the approximation power of Butterfly-Net to the Fourier representation of input data shows that the error decays exponentially as the depth increases. Combining Butterfly-Net with a fully connected neural network, a large class of problems are proved to be well approximated with network complexity depending on the effective frequency bandwidth instead of the input dimension. Regular CNN is covered as a special case in our analysis. Numerical experiments validate the analytical results on the approximation of Fourier kernels and energy functionals of Poisson's equations. Moreover, all experiments support that training from Butterfly initialization outperforms training from random initialization. Also, adding the remaining cross-channel connections, although significantly increase the parameter number, does not much improve the post-training accuracy and is more sensitive to data distribution.

Code Implementations1 repo
Foundations

The foundational work for this paper's niche, ranked by how specifically the neighbourhood builds on it — not by global fame.

Your Notes