LGMLSep 16, 2020

Activation Functions: Do They Represent A Trade-Off Between Modular Nature of Neural Networks And Task Performance

arXiv:2009.07793v1Has Code
Originality Synthesis-oriented
AI Analysis

This work addresses the design of neural network architectures for researchers and practitioners, but it appears incremental as it focuses on a specific component without presenting new results.

The paper investigates whether the ReLU activation function is optimal for achieving better modularity in neural networks, exploring the trade-off between modular structure and task performance.

Current research suggests that the key factors in designing neural network architectures involve choosing number of filters for every convolution layer, number of hidden neurons for every fully connected layer, dropout and pruning. The default activation function in most cases is the ReLU, as it has empirically shown faster training convergence. We explore whether ReLU is the best choice if one is aiming to desire better modularity structure within a neural network.

Code Implementations1 repo
Foundations

The foundational work for this paper's niche, ranked by how specifically the neighbourhood builds on it — not by global fame.

Your Notes