NEMar 17, 2020

iNALU: Improved Neural Arithmetic Logic Unit

arXiv:2003.07629v118 citations
AI Analysis

This work addresses a specific problem in neural network design for researchers and practitioners, offering incremental improvements over the original NALU model.

The paper tackled the limitations of the Neural Arithmetic Logic Unit (NALU), which struggles with negative inputs and training instability, by proposing an improved architecture that enhances arithmetic precision and convergence in various tasks.

Neural networks have to capture mathematical relationships in order to learn various tasks. They approximate these relations implicitly and therefore often do not generalize well. The recently proposed Neural Arithmetic Logic Unit (NALU) is a novel neural architecture which is able to explicitly represent the mathematical relationships by the units of the network to learn operations such as summation, subtraction or multiplication. Although NALUs have been shown to perform well on various downstream tasks, an in-depth analysis reveals practical shortcomings by design, such as the inability to multiply or divide negative input values or training stability issues for deeper networks. We address these issues and propose an improved model architecture. We evaluate our model empirically in various settings from learning basic arithmetic operations to more complex functions. Our experiments indicate that our model solves stability issues and outperforms the original NALU model in means of arithmetic precision and convergence.

Code Implementations2 repos
Foundations

The foundational work for this paper's niche, ranked by how specifically the neighbourhood builds on it — not by global fame.

Your Notes