CLJul 1, 2024

HyperLoader: Integrating Hypernetwork-Based LoRA and Adapter Layers into Multi-Task Transformers for Sequence Labelling

arXiv:2407.01411v37 citationsh-index: 39
Originality Incremental advance
AI Analysis

This is an incremental improvement for multi-task learning in NLP, reducing task interference while combining parameter-efficient methods.

The paper tackles the problem of multi-task sequence labeling by integrating hypernetwork-based LoRA and adapter layers into transformers, achieving the best average performance across tasks in both high-resource and low-resource scenarios.

We present HyperLoader, a simple approach that combines different parameter-efficient fine-tuning methods in a multi-task setting. To achieve this goal, our model uses a hypernetwork to generate the weights of these modules based on the task, the transformer layer, and its position within this layer. Our method combines the benefits of multi-task learning by capturing the structure of all tasks while reducing the task interference problem by encapsulating the task-specific knowledge in the generated weights and the benefits of combining different parameter-efficient methods to outperform full-fine tuning. We provide empirical evidence that HyperLoader outperforms previous approaches in most datasets and obtains the best average performance across tasks in high-resource and low-resource scenarios.

Foundations

The foundational work for this paper's niche, ranked by how specifically the neighbourhood builds on it — not by global fame.

Your Notes