LGNEDec 4, 2015

Fixed-Point Performance Analysis of Recurrent Neural Networks

arXiv:1512.01322v366 citations
Originality Synthesis-oriented
AI Analysis

This work addresses hardware efficiency for RNN implementations, but it is incremental as it applies known quantization methods to RNNs.

The paper tackles the hardware complexity of recurrent neural networks by analyzing fixed-point performance using retrain-based quantization, achieving minimized weight capacity without performance loss in language modeling and phoneme recognition tasks.

Recurrent neural networks have shown excellent performance in many applications, however they require increased complexity in hardware or software based implementations. The hardware complexity can be much lowered by minimizing the word-length of weights and signals. This work analyzes the fixed-point performance of recurrent neural networks using a retrain based quantization method. The quantization sensitivity of each layer in RNNs is studied, and the overall fixed-point optimization results minimizing the capacity of weights while not sacrificing the performance are presented. A language model and a phoneme recognition examples are used.

Foundations

The foundational work for this paper's niche, ranked by how specifically the neighbourhood builds on it — not by global fame.

Your Notes