LG AIApr 26, 2023

Feed-Forward Optimization With Delayed Feedback for Neural Network Training

Katharina Flügel, Daniel Coquelin, Marie Weiel, Charlotte Debus, Achim Streit, Markus Götz

arXiv:2304.13372v23.83 citationsh-index: 7Has Code

Originality Incremental advance

AI Analysis

This work addresses the problem of developing more biologically plausible and efficient training algorithms for neural networks, though it is incremental compared to existing alternatives.

The paper tackled the biological implausibility and inefficiency of backpropagation in neural network training by proposing Feed-Forward with delayed Feedback (F^3), which approximates gradients using fixed random feedback and delayed error information, resulting in narrowing the performance gap to backpropagation by up to 56% for classification and 96% for regression.

Backpropagation has long been criticized for being biologically implausible due to its reliance on concepts that are not viable in natural learning processes. Two core issues are the weight transport and update locking problems caused by the forward-backward dependencies, which limit biological plausibility, computational efficiency, and parallelization. Although several alternatives have been proposed to increase biological plausibility, they often come at the cost of reduced predictive performance. This paper proposes an alternative approach to training feed-forward neural networks addressing these issues by using approximate gradient information. We introduce Feed-Forward with delayed Feedback (F$^3$), which approximates gradients using fixed random feedback paths and delayed error information from the previous epoch to balance biological plausibility with predictive performance. We evaluate F$^3$ across multiple tasks and architectures, including both fully-connected and Transformer networks. Our results demonstrate that, compared to similarly plausible approaches, F$^3$ significantly improves predictive performance, narrowing the gap to backpropagation by up to 56% for classification and 96% for regression. This work is a step towards more biologically plausible learning algorithms while opening up new avenues for energy-efficient and parallelizable neural network training.

View on arXiv PDF Code

Similar