LGCLMLMay 28, 2021

Weighted Training for Cross-Task Learning

arXiv:2105.14095v230 citations
Originality Incremental advance
AI Analysis

This work addresses cross-task learning challenges in NLP, offering a method with theoretical guarantees and practical efficiency, though it is incremental as it builds on existing weighted training and representation-based approaches.

The paper tackles the problem of cross-task learning by introducing Target-Aware Weighted Training (TAWT), a weighted training algorithm that minimizes representation-based task distances, and shows its effectiveness through experiments on four NLP sequence tagging tasks with BERT, achieving competitive results.

In this paper, we introduce Target-Aware Weighted Training (TAWT), a weighted training algorithm for cross-task learning based on minimizing a representation-based task distance between the source and target tasks. We show that TAWT is easy to implement, is computationally efficient, requires little hyperparameter tuning, and enjoys non-asymptotic learning-theoretic guarantees. The effectiveness of TAWT is corroborated through extensive experiments with BERT on four sequence tagging tasks in natural language processing (NLP), including part-of-speech (PoS) tagging, chunking, predicate detection, and named entity recognition (NER). As a byproduct, the proposed representation-based task distance allows one to reason in a theoretically principled way about several critical aspects of cross-task learning, such as the choice of the source data and the impact of fine-tuning.

Code Implementations1 repo
Foundations

The foundational work for this paper's niche, ranked by how specifically the neighbourhood builds on it — not by global fame.

Your Notes