HCCLLGAug 17, 2022

ShortcutLens: A Visual Analytics Approach for Exploring Shortcuts in Natural Language Understanding Dataset

arXiv:2208.08010v112 citationsh-index: 58
Originality Incremental advance
AI Analysis

This addresses the issue for NLU researchers and dataset creators by providing a tool to systematically analyze dataset biases, though it is incremental as it builds on existing visual analytics approaches for data exploration.

The paper tackles the problem of shortcuts (unwanted biases) in natural language understanding benchmark datasets, which can undermine model evaluation, by developing ShortcutLens, a visual analytics system that helps experts explore these shortcuts through multi-level views, with case studies and expert interviews showing it supports better understanding and creation of challenging datasets.

Benchmark datasets play an important role in evaluating Natural Language Understanding (NLU) models. However, shortcuts -- unwanted biases in the benchmark datasets -- can damage the effectiveness of benchmark datasets in revealing models' real capabilities. Since shortcuts vary in coverage, productivity, and semantic meaning, it is challenging for NLU experts to systematically understand and avoid them when creating benchmark datasets. In this paper, we develop a visual analytics system, ShortcutLens, to help NLU experts explore shortcuts in NLU benchmark datasets. The system allows users to conduct multi-level exploration of shortcuts. Specifically, Statistics View helps users grasp the statistics such as coverage and productivity of shortcuts in the benchmark dataset. Template View employs hierarchical and interpretable templates to summarize different types of shortcuts. Instance View allows users to check the corresponding instances covered by the shortcuts. We conduct case studies and expert interviews to evaluate the effectiveness and usability of the system. The results demonstrate that ShortcutLens supports users in gaining a better understanding of benchmark dataset issues through shortcuts, inspiring them to create challenging and pertinent benchmark datasets.

Foundations

The foundational work for this paper's niche, ranked by how specifically the neighbourhood builds on it — not by global fame.

Your Notes