CVAILGJun 1, 2021

Neural Network Structure Design based on N-Gauss Activation Function

arXiv:2106.07562v1
Originality Synthesis-oriented
AI Analysis

This is an incremental improvement for neural network design, potentially benefiting researchers and practitioners in deep learning by offering an alternative activation function.

The paper tackled the problem of designing neural network structures using a new N-Gauss activation function, and the result showed that N-Gauss enables deep convolutional neural networks to have hierarchical nonlinear mapping learning capabilities, with training performance on simple datasets equivalent to ReLU and Swish.

Recent work has shown that the activation function of the convolutional neural network can meet the Lipschitz condition, then the corresponding convolutional neural network structure can be constructed according to the scale of the data set, and the data set can be trained more deeply, more accurately and more effectively. In this article, we have accepted the experimental results and introduced the core block N-Gauss, N-Gauss, and Swish (Conv1, Conv2, FC1) neural network structure design to train MNIST, CIFAR10, and CIFAR100 respectively. Experiments show that N-Gauss gives full play to the main role of nonlinear modeling of activation functions, so that deep convolutional neural networks have hierarchical nonlinear mapping learning capabilities. At the same time, the training ability of N-Gauss on simple one-dimensional channel small data sets is equivalent to the performance of ReLU and Swish.

Foundations

The foundational work for this paper's niche, ranked by how specifically the neighbourhood builds on it — not by global fame.

Your Notes