NEAICVLGSep 9, 2021

ErfAct and Pserf: Non-monotonic Smooth Trainable Activation Functions

arXiv:2109.04386v416 citations
Originality Incremental advance
AI Analysis

This work addresses the need for better activation functions in neural networks to enhance accuracy and performance, though it appears incremental as it builds on existing activation function concepts.

The authors tackled the problem of improving neural network performance by proposing two novel non-monotonic smooth trainable activation functions, ErfAct and Pserf, which achieved improvements such as 5.68% and 5.42% in top-1 accuracy on Shufflenet V2 for CIFAR100 compared to ReLU.

An activation function is a crucial component of a neural network that introduces non-linearity in the network. The state-of-the-art performance of a neural network depends also on the perfect choice of an activation function. We propose two novel non-monotonic smooth trainable activation functions, called ErfAct and Pserf. Experiments suggest that the proposed functions improve the network performance significantly compared to the widely used activations like ReLU, Swish, and Mish. Replacing ReLU by ErfAct and Pserf, we have 5.68% and 5.42% improvement for top-1 accuracy on Shufflenet V2 (2.0x) network in CIFAR100 dataset, 2.11% and 1.96% improvement for top-1 accuracy on Shufflenet V2 (2.0x) network in CIFAR10 dataset, 1.0%, and 1.0% improvement on mean average precision (mAP) on SSD300 model in Pascal VOC dataset.

Foundations

The foundational work for this paper's niche, ranked by how specifically the neighbourhood builds on it — not by global fame.

Your Notes