LGMar 5, 2024

Controllable Prompt Tuning For Balancing Group Distributional Robustness

arXiv:2403.02695v212 citationsh-index: 8ICML
AI Analysis

This work addresses performance degradation in models across different groups under distribution shifts, offering a computationally efficient solution that balances group performance without severe trade-offs.

The paper tackles the problem of models performing poorly on certain groups under distribution shifts by introducing Controllable Prompt Tuning (CPT), which achieves state-of-the-art results on spurious correlation benchmarks across various architectures and data types while using only 0.4% tunable parameters.

Models trained on data composed of different groups or domains can suffer from severe performance degradation under distribution shifts. While recent methods have largely focused on optimizing the worst-group objective, this often comes at the expense of good performance on other groups. To address this problem, we introduce an optimization scheme to achieve good performance across groups and find a good solution for all without severely sacrificing performance on any of them. However, directly applying such optimization involves updating the parameters of the entire network, making it both computationally expensive and challenging. Thus, we introduce Controllable Prompt Tuning (CPT), which couples our approach with prompt-tuning techniques. On spurious correlation benchmarks, our procedures achieve state-of-the-art results across both transformer and non-transformer architectures, as well as unimodal and multimodal data, while requiring only 0.4% tunable parameters.

Code Implementations1 repo
Foundations

The foundational work for this paper's niche, ranked by how specifically the neighbourhood builds on it — not by global fame.

Your Notes