CLDec 19, 2022

A Comparative Study on Textual Saliency of Styles from Eye Tracking, Annotations, and Language Models

arXiv:2212.09873v216.7132 citationsh-index: 22Has Code

Originality Synthesis-oriented

AI Analysis

This work addresses the integration of human language processing data into NLP for evaluating model cognitive plausibility, but it is incremental as it compares existing methods without introducing a new paradigm.

The paper tackled the problem of understanding how eye-tracking data relates to human annotations and model-based metrics for stylistic text saliency, finding that it intersects with both and can bridge human and machine perspectives.

There is growing interest in incorporating eye-tracking data and other implicit measures of human language processing into natural language processing (NLP) pipelines. The data from human language processing contain unique insight into human linguistic understanding that could be exploited by language models. However, many unanswered questions remain about the nature of this data and how it can best be utilized in downstream NLP tasks. In this paper, we present eyeStyliency, an eye-tracking dataset for human processing of stylistic text (e.g., politeness). We develop a variety of methods to derive style saliency scores over text using the collected eye dataset. We further investigate how this saliency data compares to both human annotation methods and model-based interpretability metrics. We find that while eye-tracking data is unique, it also intersects with both human annotations and model-based importance scores, providing a possible bridge between human- and machine-based perspectives. We propose utilizing this type of data to evaluate the cognitive plausibility of models that interpret style. Our eye-tracking data and processing code are publicly available.

View on arXiv PDF Code

Similar