CVLGNAJan 29, 2019

MgNet: A Unified Framework of Multigrid and Convolutional Neural Network

arXiv:1901.10415v277 citations
Originality Incremental advance
AI Analysis

This work provides a theoretical bridge between CNNs and PDE solvers, potentially aiding in understanding CNN operations, but it is incremental as it builds on existing methods with specific improvements.

The authors developed MgNet, a unified framework that connects convolutional neural networks (CNNs) for image classification with multigrid methods for solving PDEs, leading to modified CNN models with fewer weights and hyperparameters that achieved competitive or better performance on CIFAR-10 and CIFAR-100 datasets.

We develop a unified model, known as MgNet, that simultaneously recovers some convolutional neural networks (CNN) for image classification and multigrid (MG) methods for solving discretized partial differential equations (PDEs). This model is based on close connections that we have observed and uncovered between the CNN and MG methodologies. For example, pooling operation and feature extraction in CNN correspond directly to restriction operation and iterative smoothers in MG, respectively. As the solution space is often the dual of the data space in PDEs, the analogous concept of feature space and data space (which are dual to each other) is introduced in CNN. With such connections and new concept in the unified model, the function of various convolution operations and pooling used in CNN can be better understood. As a result, modified CNN models (with fewer weights and hyper parameters) are developed that exhibit competitive and sometimes better performance in comparison with existing CNN models when applied to both CIFAR-10 and CIFAR-100 data sets.

Foundations

The foundational work for this paper's niche, ranked by how specifically the neighbourhood builds on it — not by global fame.

Your Notes