LGCVOct 22, 2020

A ReLU Dense Layer to Improve the Performance of Neural Networks

arXiv:2010.13572v141 citationsHas Code
Originality Synthesis-oriented
AI Analysis

This is an incremental improvement for neural network practitioners seeking to enhance model performance with low complexity.

The authors tackled the problem of improving neural network performance by proposing ReDense, a ReLU dense layer added to trained networks, which achieved lower training loss while maintaining generalization error, as demonstrated on benchmark datasets.

We propose ReDense as a simple and low complexity way to improve the performance of trained neural networks. We use a combination of random weights and rectified linear unit (ReLU) activation function to add a ReLU dense (ReDense) layer to the trained neural network such that it can achieve a lower training loss. The lossless flow property (LFP) of ReLU is the key to achieve the lower training loss while keeping the generalization error small. ReDense does not suffer from vanishing gradient problem in the training due to having a shallow structure. We experimentally show that ReDense can improve the training and testing performance of various neural network architectures with different optimization loss and activation functions. Finally, we test ReDense on some of the state-of-the-art architectures and show the performance improvement on benchmark datasets.

Code Implementations1 repo
Foundations

The foundational work for this paper's niche, ranked by how specifically the neighbourhood builds on it — not by global fame.

Your Notes