LGMLOct 6, 2019

Auto-Rotating Perceptrons

arXiv:1910.02483v21 citations
Originality Incremental advance
AI Analysis

This addresses a known bottleneck in training deep neural networks with bounded activations, but appears incremental as it modifies neuron design without changing inference structure.

The paper tackles the vanishing gradient problem in deep multilayer perceptron networks by introducing auto-rotating perceptrons (ARP) that prevent saturation in activation functions, resulting in improved learning performance compared to classic perceptrons.

This paper proposes an improved design of the perceptron unit to mitigate the vanishing gradient problem. This nuisance appears when training deep multilayer perceptron networks with bounded activation functions. The new neuron design, named auto-rotating perceptron (ARP), has a mechanism to ensure that the node always operates in the dynamic region of the activation function, by avoiding saturation of the perceptron. The proposed method does not change the inference structure learned at each neuron. We test the effect of using ARP units in some network architectures which use the sigmoid activation function. The results support our hypothesis that neural networks with ARP units can achieve better learning performance than equivalent models with classic perceptrons.

Code Implementations1 repo
Foundations

The foundational work for this paper's niche, ranked by how specifically the neighbourhood builds on it — not by global fame.

Your Notes