CL AI LGNov 14, 2023

Low-Rank Adaptation for Multilingual Summarization: An Empirical Study

Chenxi Whitehouse, Fantine Huot, Jasmijn Bastings, Mostafa Dehghani, Chu-Cheng Lin, Mirella Lapata

DeepMind

arXiv:2311.08572v211.437 citationsh-index: 86

Originality Incremental advance

AI Analysis

This addresses memory efficiency for multilingual summarization, an incremental improvement in parameter-efficient fine-tuning.

The study tackled the challenge of fine-tuning large language models for multilingual summarization by investigating Low-Rank Adaptation (LoRA), finding it competitive with full fine-tuning in high-data settings and excelling in low-data scenarios and cross-lingual transfer.

Although the advancements of pre-trained Large Language Models have significantly accelerated recent progress in NLP, their ever-increasing size poses significant challenges for conventional fine-tuning, especially in memory-intensive tasks. We investigate the potential of Parameter-Efficient Fine-Tuning, focusing on Low-Rank Adaptation (LoRA), in the domain of multilingual summarization, a task that is both challenging (due to typically long inputs), and relatively unexplored. We conduct an extensive study across different data availability scenarios, including high- and low-data settings, and cross-lingual transfer, leveraging models of different sizes. Our findings reveal that LoRA is competitive with full fine-tuning when trained with high quantities of data, and excels in low-data scenarios and cross-lingual transfer. We also study different strategies for few-shot cross-lingual transfer, finding that continued LoRA tuning outperforms full fine-tuning and the dynamic composition of language-specific LoRA modules.

View on arXiv PDF

Similar