LGAIMLOct 14, 2018

Variational Neural Networks: Every Layer and Neuron Can Be Unique

arXiv:1810.06120v1
Originality Incremental advance
AI Analysis

This addresses the lack of guiding principles for activation function selection in neural networks, offering a novel optimization approach.

The paper tackles the problem of selecting activation functions in neural networks by introducing variational neural networks, where activation functions are represented as linear combinations of candidate functions and optimized via gradient descent, resulting in a method to derive optimal activations.

The choice of activation function can significantly influence the performance of neural networks. The lack of guiding principles for the selection of activation function is lamentable. We try to address this issue by introducing our variational neural networks, where the activation function is represented as a linear combination of possible candidate functions, and an optimal activation is obtained via minimization of a loss function using gradient descent method. The gradient formulae for the loss function with respect to these expansion coefficients are central for the implementation of gradient descent algorithm, and here we derive these gradient formulae.

Foundations

The foundational work for this paper's niche, ranked by how specifically the neighbourhood builds on it — not by global fame.

Your Notes