IVCVMar 18, 2024

PAON: A New Neuron Model using Padé Approximants

arXiv:2403.11791v14 citationsh-index: 62ICIP
Originality Incremental advance
AI Analysis

This work proposes a foundational neuron model that could enhance CNN performance across various applications, though it appears incremental as it builds on prior enhanced neuron models.

The paper tackles the limitation of linear neuron models in CNNs by introducing a new neuron model called Paons based on Padé approximants, which can replace neurons in any CNN and achieves better results in single-image super-resolution tasks.

Convolutional neural networks (CNN) are built upon the classical McCulloch-Pitts neuron model, which is essentially a linear model, where the nonlinearity is provided by a separate activation function. Several researchers have proposed enhanced neuron models, including quadratic neurons, generalized operational neurons, generative neurons, and super neurons, with stronger nonlinearity than that provided by the pointwise activation function. There has also been a proposal to use Pade approximation as a generalized activation function. In this paper, we introduce a brand new neuron model called Pade neurons (Paons), inspired by the Pade approximants, which is the best mathematical approximation of a transcendental function as a ratio of polynomials with different orders. We show that Paons are a super set of all other proposed neuron models. Hence, the basic neuron in any known CNN model can be replaced by Paons. In this paper, we extend the well-known ResNet to PadeNet (built by Paons) to demonstrate the concept. Our experiments on the single-image super-resolution task show that PadeNets can obtain better results than competing architectures.

Foundations

The foundational work for this paper's niche, ranked by how specifically the neighbourhood builds on it — not by global fame.

Your Notes