CL AIOct 30, 2017

Understanding Hidden Memories of Recurrent Neural Networks

Yao Ming, Shaozu Cao, Ruixiang Zhang, Zhen Li, Yuanzhe Chen, Yangqiu Song, Huamin Qu

arXiv:1710.10777v115.5223 citations

Originality Incremental advance

AI Analysis

This addresses the lack of interpretability in RNNs for NLP researchers, but it is incremental as it builds on existing visualization techniques.

The paper tackled the problem of understanding the mechanisms behind recurrent neural networks (RNNs) in NLP by developing a visual analytics method to explain hidden state units and their responses to input texts, with effectiveness demonstrated through case studies and expert reviews.

Recurrent neural networks (RNNs) have been successfully applied to various natural language processing (NLP) tasks and achieved better results than conventional methods. However, the lack of understanding of the mechanisms behind their effectiveness limits further improvements on their architectures. In this paper, we present a visual analytics method for understanding and comparing RNN models for NLP tasks. We propose a technique to explain the function of individual hidden state units based on their expected response to input texts. We then co-cluster hidden state units and words based on the expected response and visualize co-clustering results as memory chips and word clouds to provide more structured knowledge on RNNs' hidden states. We also propose a glyph-based sequence visualization based on aggregate information to analyze the behavior of an RNN's hidden state at the sentence-level. The usability and effectiveness of our method are demonstrated through case studies and reviews from domain experts.

View on arXiv PDF

Similar