CL LG NESep 13, 2019

SANVis: Visual Analytics for Understanding Self-Attention Networks

Cheonbok Park, Inyoup Na, Yongjang Jo, Sungbok Shin, Jaehyo Yoo, Bum Chul Kwon, Jian Zhao, Hyungjong Noh, Yeonsoo Lee, Jaegul Choo

arXiv:1909.09595v13.044 citations

Originality Synthesis-oriented

AI Analysis

This addresses the problem of model interpretability for researchers and practitioners using attention-based networks, though it is incremental as it builds on existing visualization tools for neural networks.

The paper tackles the challenge of understanding complex multi-head self-attention networks by presenting SANVis, a visual analytics system that helps users interpret model behaviors and characteristics, demonstrated using the Transformer model in machine translation tasks.

Attention networks, a deep neural network architecture inspired by humans' attention mechanism, have seen significant success in image captioning, machine translation, and many other applications. Recently, they have been further evolved into an advanced approach called multi-head self-attention networks, which can encode a set of input vectors, e.g., word vectors in a sentence, into another set of vectors. Such encoding aims at simultaneously capturing diverse syntactic and semantic features within a set, each of which corresponds to a particular attention head, forming altogether multi-head attention. Meanwhile, the increased model complexity prevents users from easily understanding and manipulating the inner workings of models. To tackle the challenges, we present a visual analytics system called SANVis, which helps users understand the behaviors and the characteristics of multi-head self-attention networks. Using a state-of-the-art self-attention model called Transformer, we demonstrate usage scenarios of SANVis in machine translation tasks. Our system is available at http://short.sanvis.org

View on arXiv PDF

Similar