CV IVJan 7

Padé Neurons for Efficient Neural Models

arXiv:2601.04005v1Has CodeIEEE Transactions on Image Processing

Originality Highly original

AI Analysis

This work addresses the problem of inefficient neural network architectures for researchers and practitioners by offering a more expressive neuron model that can reduce layer count while maintaining performance.

The paper tackles the limitation of linear neuron models by introducing Padé neurons (Paons), a novel non-linear neuron model that learns diverse non-linear functions, and demonstrates that neural models using Paons achieve better or equal performance with fewer layers in tasks like image super-resolution, compression, and classification.

Neural networks commonly employ the McCulloch-Pitts neuron model, which is a linear model followed by a point-wise non-linear activation. Various researchers have already advanced inherently non-linear neuron models, such as quadratic neurons, generalized operational neurons, generative neurons, and super neurons, which offer stronger non-linearity compared to point-wise activation functions. In this paper, we introduce a novel and better non-linear neuron model called Padé neurons (Paons), inspired by Padé approximants. Paons offer several advantages, such as diversity of non-linearity, since each Paon learns a different non-linear function of its inputs, and layer efficiency, since Paons provide stronger non-linearity in much fewer layers compared to piecewise linear approximation. Furthermore, Paons include all previously proposed neuron models as special cases, thus any neuron model in any network can be replaced by Paons. We note that there has been a proposal to employ the Padé approximation as a generalized point-wise activation function, which is fundamentally different from our model. To validate the efficacy of Paons, in our experiments, we replace classic neurons in some well-known neural image super-resolution, compression, and classification models based on the ResNet architecture with Paons. Our comprehensive experimental results and analyses demonstrate that neural models built by Paons provide better or equal performance than their classic counterparts with a smaller number of layers. The PyTorch implementation code for Paon is open-sourced at https://github.com/onur-keles/Paon.

View on arXiv PDF Code

Similar