LGCCMLSep 28, 2018

Learning Recurrent Binary/Ternary Weights

arXiv:1809.11086v229 citations
Originality Incremental advance
AI Analysis

This addresses the problem of embedding RNNs on mobile devices with limited resources, though it is incremental as it builds on existing quantization techniques.

The paper tackles the complexity and memory intensity of recurrent neural networks (RNNs) by introducing a method to learn binary and ternary weights during training, achieving competitive accuracy on tasks like sequence classification and language modeling while enabling up to 12x memory saving and 10x inference speedup on hardware.

Recurrent neural networks (RNNs) have shown excellent performance in processing sequence data. However, they are both complex and memory intensive due to their recursive nature. These limitations make RNNs difficult to embed on mobile devices requiring real-time processes with limited hardware resources. To address the above issues, we introduce a method that can learn binary and ternary weights during the training phase to facilitate hardware implementations of RNNs. As a result, using this approach replaces all multiply-accumulate operations by simple accumulations, bringing significant benefits to custom hardware in terms of silicon area and power consumption. On the software side, we evaluate the performance (in terms of accuracy) of our method using long short-term memories (LSTMs) on various sequential models including sequence classification and language modeling. We demonstrate that our method achieves competitive results on the aforementioned tasks while using binary/ternary weights during the runtime. On the hardware side, we present custom hardware for accelerating the recurrent computations of LSTMs with binary/ternary weights. Ultimately, we show that LSTMs with binary/ternary weights can achieve up to 12x memory saving and 10x inference speedup compared to the full-precision implementation on an ASIC platform.

Code Implementations1 repo
Foundations

The foundational work for this paper's niche, ranked by how specifically the neighbourhood builds on it — not by global fame.

Your Notes