CLAIOct 30, 2017

Understanding Hidden Memories of Recurrent Neural Networks

arXiv:1710.10777v1223 citations
Originality Incremental advance
AI Analysis

This addresses the lack of interpretability in RNNs for NLP researchers, but it is incremental as it builds on existing visualization techniques.

The paper tackled the problem of understanding the mechanisms behind recurrent neural networks (RNNs) in NLP by developing a visual analytics method to explain hidden state units and their responses to input texts, with effectiveness demonstrated through case studies and expert reviews.

Recurrent neural networks (RNNs) have been successfully applied to various natural language processing (NLP) tasks and achieved better results than conventional methods. However, the lack of understanding of the mechanisms behind their effectiveness limits further improvements on their architectures. In this paper, we present a visual analytics method for understanding and comparing RNN models for NLP tasks. We propose a technique to explain the function of individual hidden state units based on their expected response to input texts. We then co-cluster hidden state units and words based on the expected response and visualize co-clustering results as memory chips and word clouds to provide more structured knowledge on RNNs' hidden states. We also propose a glyph-based sequence visualization based on aggregate information to analyze the behavior of an RNN's hidden state at the sentence-level. The usability and effectiveness of our method are demonstrated through case studies and reviews from domain experts.

Code Implementations1 repo
Foundations

The foundational work for this paper's niche, ranked by how specifically the neighbourhood builds on it — not by global fame.

Your Notes