CLOct 31, 2023

Improving Prompt Tuning with Learned Prompting Layers

arXiv:2310.20127v13 citationsh-index: 4
Originality Incremental advance
AI Analysis

This work addresses a bottleneck in prompt tuning for adapting pretrained models to downstream tasks, offering an incremental improvement over existing methods.

The paper tackles the problem of suboptimal manual selection of prompt layers in prompt tuning by proposing Selective Prompt Tuning (SPT), which learns to select proper prompt layers using learnable probabilistic gates, and achieves better performance than previous state-of-the-art PETuning baselines on ten benchmark datasets in full-data and few-shot scenarios.

Prompt tuning prepends a soft prompt to the input embeddings or hidden states and only optimizes the prompt to adapt pretrained models (PTMs) to downstream tasks. The previous work manually selects prompt layers which are far from optimal and failed to exploit the potential of prompt tuning. In this work, we propose a novel framework, \underline{S}elective \underline{P}rompt \underline{T}uning (SPT), that learns to select the proper prompt layers by inserting a prompt controlled by a learnable probabilistic gate at each intermediate layer. We further propose a novel bi-level optimization framework, SPT-DARTS, that can better optimize the learnable gates and improve the final prompt tuning performances of the learned prompt layer settings. We conduct extensive experiments with ten benchmark datasets under the full-data and few-shot scenarios. The results demonstrate that our SPT framework can perform better than the previous state-of-the-art PETuning baselines with comparable or fewer tunable parameters.

Foundations

The foundational work for this paper's niche, ranked by how specifically the neighbourhood builds on it — not by global fame.

Your Notes