CL LGMar 29, 2022

Visualizing the Relationship Between Encoded Linguistic Information and Task Performance

Jiannan Xiang, Huayang Li, Defu Lian, Guoping Huang, Taro Watanabe, Lemao Liu

arXiv:2203.15860v131.9639 citationsh-index: 43

Originality Incremental advance

AI Analysis

This work addresses the incremental problem of understanding performance trade-offs in NLP models for researchers, showing that optimizing linguistic encoding is not straightforward.

The study investigates how encoded linguistic information affects task performance in NLP models by using Pareto optimality to analyze the trade-off, finding that while some syntactic information helps, more does not always improve performance due to model architecture constraints.

Probing is popular to analyze whether linguistic information can be captured by a well-trained deep neural model, but it is hard to answer how the change of the encoded linguistic information will affect task performance. To this end, we study the dynamic relationship between the encoded linguistic information and task performance from the viewpoint of Pareto Optimality. Its key idea is to obtain a set of models which are Pareto-optimal in terms of both objectives. From this viewpoint, we propose a method to optimize the Pareto-optimal models by formalizing it as a multi-objective optimization problem. We conduct experiments on two popular NLP tasks, i.e., machine translation and language modeling, and investigate the relationship between several kinds of linguistic information and task performances. Experimental results demonstrate that the proposed method is better than a baseline method. Our empirical findings suggest that some syntactic information is helpful for NLP tasks whereas encoding more syntactic information does not necessarily lead to better performance, because the model architecture is also an important factor.

View on arXiv PDF

Similar