CLAILGOct 14, 2021

UniPELT: A Unified Framework for Parameter-Efficient Language Model Tuning

arXiv:2110.07577v3677 citations
Originality Incremental advance
AI Analysis

This addresses the challenge of model selection for practitioners using PELT methods, especially with limited data, though it is incremental as it builds on existing PELT techniques.

The paper tackles the problem of selecting the best parameter-efficient language model tuning (PELT) method for specific tasks by proposing UniPELT, a unified framework that incorporates multiple PELT methods and learns to activate the most suitable ones via a gating mechanism, achieving 1-4% gains over the best individual PELT method on the GLUE benchmark and outperforming fine-tuning in some setups.

Recent parameter-efficient language model tuning (PELT) methods manage to match the performance of fine-tuning with much fewer trainable parameters and perform especially well when training data is limited. However, different PELT methods may perform rather differently on the same task, making it nontrivial to select the most appropriate method for a specific task, especially considering the fast-growing number of new PELT methods and tasks. In light of model diversity and the difficulty of model selection, we propose a unified framework, UniPELT, which incorporates different PELT methods as submodules and learns to activate the ones that best suit the current data or task setup via gating mechanism. On the GLUE benchmark, UniPELT consistently achieves 1~4% gains compared to the best individual PELT method that it incorporates and even outperforms fine-tuning under different setups. Moreover, UniPELT generally surpasses the upper bound that takes the best performance of all its submodules used individually on each task, indicating that a mixture of multiple PELT methods may be inherently more effective than single methods.

Code Implementations1 repo
Foundations

The foundational work for this paper's niche, ranked by how specifically the neighbourhood builds on it — not by global fame.

Your Notes