CLJul 14, 2023

Sensi-BERT: Towards Sensitivity Driven Fine-Tuning for Parameter-Efficient BERT

Souvik Kundu, Sharath Nittur Sridhar, Maciej Szankin, Sairam Sundaresan

arXiv:2307.11764v21.32 citationsh-index: 20

Originality Incremental advance

AI Analysis

This addresses the challenge of deploying large language models on edge devices, offering a parameter-efficient fine-tuning solution that is incremental over existing methods.

The paper tackles the problem of fine-tuning large pre-trained BERT models for resource-constrained edge devices by proposing Sensi-BERT, a sensitivity-driven method that trims parameters during fine-tuning based on a budget, resulting in better performance at similar or smaller parameter budgets across tasks like MNLI, QQP, QNLI, SST-2, and SQuAD.

Large pre-trained language models have recently gained significant traction due to their improved performance on various down-stream tasks like text classification and question answering, requiring only few epochs of fine-tuning. However, their large model sizes often prohibit their applications on resource-constrained edge devices. Existing solutions of yielding parameter-efficient BERT models largely rely on compute-exhaustive training and fine-tuning. Moreover, they often rely on additional compute heavy models to mitigate the performance gap. In this paper, we present Sensi-BERT, a sensitivity driven efficient fine-tuning of BERT models that can take an off-the-shelf pre-trained BERT model and yield highly parameter-efficient models for downstream tasks. In particular, we perform sensitivity analysis to rank each individual parameter tensor, that then is used to trim them accordingly during fine-tuning for a given parameter or FLOPs budget. Our experiments show the efficacy of Sensi-BERT across different downstream tasks including MNLI, QQP, QNLI, SST-2 and SQuAD, showing better performance at similar or smaller parameter budget compared to various alternatives.

View on arXiv PDF

Similar