LG NE MLJun 28, 2019

ARMIN: Towards a More Efficient and Light-weight Recurrent Memory Network

Zhangheng Li, Jia-Xing Zhong, Jingjia Huang, Tao Zhang, Thomas Li, Ge Li

arXiv:1906.12087v11.82 citationsh-index: 50Has Code

Originality Incremental advance

AI Analysis

This work addresses computational bottlenecks in sequential processing tasks for researchers and practitioners using memory-augmented neural networks, though it appears incremental as it builds on existing MANN frameworks.

The paper tackles the inefficiency and computational overhead of memory-augmented neural networks (MANNs) by introducing ARMIN, which uses a simplified auto-addressing mechanism and a novel RNN cell, resulting in a more light-weight and efficient model with much lower computational overhead than vanilla LSTM while maintaining similar performance.

In recent years, memory-augmented neural networks(MANNs) have shown promising power to enhance the memory ability of neural networks for sequential processing tasks. However, previous MANNs suffer from complex memory addressing mechanism, making them relatively hard to train and causing computational overheads. Moreover, many of them reuse the classical RNN structure such as LSTM for memory processing, causing inefficient exploitations of memory information. In this paper, we introduce a novel MANN, the Auto-addressing and Recurrent Memory Integrating Network (ARMIN) to address these issues. The ARMIN only utilizes hidden state ht for automatic memory addressing, and uses a novel RNN cell for refined integration of memory information. Empirical results on a variety of experiments demonstrate that the ARMIN is more light-weight and efficient compared to existing memory networks. Moreover, we demonstrate that the ARMIN can achieve much lower computational overhead than vanilla LSTM while keeping similar performances. Codes are available on github.com/zoharli/armin.

View on arXiv PDF Code

Similar