CLJun 7, 2024

Low-Resource Cross-Lingual Summarization through Few-Shot Learning with Large Language Models

arXiv:2406.04630v114.428 citationsHas Code

Originality Incremental advance

AI Analysis

This addresses the challenge of generating summaries across languages with limited parallel data, which is incremental as it builds on existing zero-shot XLS methods by exploring few-shot capabilities.

The paper tackled the problem of cross-lingual summarization (XLS) for low-resource languages by investigating few-shot learning with large language models (LLMs), finding that it significantly improves performance for GPT-3.5 and GPT-4, while Mistral-7B-Instruct-v0.2 struggles to adapt effectively.

Cross-lingual summarization (XLS) aims to generate a summary in a target language different from the source language document. While large language models (LLMs) have shown promising zero-shot XLS performance, their few-shot capabilities on this task remain unexplored, especially for low-resource languages with limited parallel data. In this paper, we investigate the few-shot XLS performance of various models, including Mistral-7B-Instruct-v0.2, GPT-3.5, and GPT-4. Our experiments demonstrate that few-shot learning significantly improves the XLS performance of LLMs, particularly GPT-3.5 and GPT-4, in low-resource settings. However, the open-source model Mistral-7B-Instruct-v0.2 struggles to adapt effectively to the XLS task with limited examples. Our findings highlight the potential of few-shot learning for improving XLS performance and the need for further research in designing LLM architectures and pre-training objectives tailored for this task. We provide a future work direction to explore more effective few-shot learning strategies and to investigate the transfer learning capabilities of LLMs for cross-lingual summarization.

View on arXiv PDF

Similar