CLFeb 27, 2024

Linguistic Knowledge Can Enhance Encoder-Decoder Models (If You Let It)

arXiv:2402.17608v181 citationsh-index: 30LREC
Originality Incremental advance
AI Analysis

This work addresses enhancing natural language processing models for tasks like sentence complexity prediction, but it is incremental as it builds on existing pre-trained models with linguistic fine-tuning.

The paper tackled improving encoder-decoder models by augmenting them with linguistic knowledge through intermediate fine-tuning, finding that this approach generally boosts performance in predicting sentence-level complexity, particularly for smaller models and in data-limited scenarios across Italian and English datasets.

In this paper, we explore the impact of augmenting pre-trained Encoder-Decoder models, specifically T5, with linguistic knowledge for the prediction of a target task. In particular, we investigate whether fine-tuning a T5 model on an intermediate task that predicts structural linguistic properties of sentences modifies its performance in the target task of predicting sentence-level complexity. Our study encompasses diverse experiments conducted on Italian and English datasets, employing both monolingual and multilingual T5 models at various sizes. Results obtained for both languages and in cross-lingual configurations show that linguistically motivated intermediate fine-tuning has generally a positive impact on target task performance, especially when applied to smaller models and in scenarios with limited data availability.

Code Implementations1 repo
Foundations

The foundational work for this paper's niche, ranked by how specifically the neighbourhood builds on it — not by global fame.

Your Notes