CLLGMar 29, 2022

Visualizing the Relationship Between Encoded Linguistic Information and Task Performance

arXiv:2203.15860v1639 citations
Originality Incremental advance
AI Analysis

This work addresses the incremental problem of understanding performance trade-offs in NLP models for researchers, showing that optimizing linguistic encoding is not straightforward.

The study investigates how encoded linguistic information affects task performance in NLP models by using Pareto optimality to analyze the trade-off, finding that while some syntactic information helps, more does not always improve performance due to model architecture constraints.

Probing is popular to analyze whether linguistic information can be captured by a well-trained deep neural model, but it is hard to answer how the change of the encoded linguistic information will affect task performance. To this end, we study the dynamic relationship between the encoded linguistic information and task performance from the viewpoint of Pareto Optimality. Its key idea is to obtain a set of models which are Pareto-optimal in terms of both objectives. From this viewpoint, we propose a method to optimize the Pareto-optimal models by formalizing it as a multi-objective optimization problem. We conduct experiments on two popular NLP tasks, i.e., machine translation and language modeling, and investigate the relationship between several kinds of linguistic information and task performances. Experimental results demonstrate that the proposed method is better than a baseline method. Our empirical findings suggest that some syntactic information is helpful for NLP tasks whereas encoding more syntactic information does not necessarily lead to better performance, because the model architecture is also an important factor.

Foundations

The foundational work for this paper's niche, ranked by how specifically the neighbourhood builds on it — not by global fame.

Your Notes