LGAICVNESep 10, 2022

APTx: better activation function than MISH, SWISH, and ReLU's variants used in deep learning

arXiv:2209.06119v67 citationsHas Code
Originality Synthesis-oriented
AI Analysis

This work addresses the need for efficient activation functions in deep learning, though it appears incremental as it builds on existing functions like MISH.

The paper introduces APTx, a new activation function that reduces computational operations compared to MISH while maintaining similar performance, speeding up model training and lowering hardware requirements.

Activation Functions introduce non-linearity in the deep neural networks. This nonlinearity helps the neural networks learn faster and efficiently from the dataset. In deep learning, many activation functions are developed and used based on the type of problem statement. ReLU's variants, SWISH, and MISH are goto activation functions. MISH function is considered having similar or even better performance than SWISH, and much better than ReLU. In this paper, we propose an activation function named APTx which behaves similar to MISH, but requires lesser mathematical operations to compute. The lesser computational requirements of APTx does speed up the model training, and thus also reduces the hardware requirement for the deep learning model. Source code: https://github.com/mr-ravin/aptx_activation

Code Implementations2 repos
Foundations

The foundational work for this paper's niche, ranked by how specifically the neighbourhood builds on it — not by global fame.

Your Notes