LGCENEAug 10, 2023

Optimizing Performance of Feedforward and Convolutional Neural Networks through Dynamic Activation Functions

arXiv:2308.05724v218 citationsh-index: 5
Originality Incremental advance
AI Analysis

This addresses performance bottlenecks in neural networks for researchers and practitioners, though it appears incremental as it modifies existing activation functions rather than introducing a new paradigm.

The paper tackles the problem of optimizing neural network performance by introducing complex piece-wise linear activation functions as alternatives to ReLU, showing they work much better for both convolutional neural networks and multilayer perceptrons in PyTorch comparisons.

Deep learning training training algorithms are a huge success in recent years in many fields including speech, text,image video etc. Deeper and deeper layers are proposed with huge success with resnet structures having around 152 layers. Shallow convolution neural networks(CNN's) are still an active research, where some phenomena are still unexplained. Activation functions used in the network are of utmost importance, as they provide non linearity to the networks. Relu's are the most commonly used activation function.We show a complex piece-wise linear(PWL) activation in the hidden layer. We show that these PWL activations work much better than relu activations in our networks for convolution neural networks and multilayer perceptrons. Result comparison in PyTorch for shallow and deep CNNs are given to further strengthen our case.

Foundations

The foundational work for this paper's niche, ranked by how specifically the neighbourhood builds on it — not by global fame.

Your Notes