CLAIAug 25, 2025

AMELIA: A Family of Multi-task End-to-end Language Models for Argumentation

arXiv:2508.17926v11 citationsArgument & Computation
Originality Synthesis-oriented
AI Analysis

This work addresses the need for efficient and effective multi-task models in argument mining, an incremental improvement by applying existing methods to a new unified dataset.

The paper tackled the problem of performing multiple argument mining tasks with a single large language model by creating a unified multi-task dataset from 19 existing datasets and exploring training strategies like fine-tuning and model merging. The results showed that task-specific fine-tuning significantly improved performance across all tasks, multi-task fine-tuning maintained strong performance without degradation, and model merging offered competitive performance while reducing computational costs.

Argument mining is a subfield of argumentation that aims to automatically extract argumentative structures and their relations from natural language texts. This paper investigates how a single large language model can be leveraged to perform one or several argument mining tasks. Our contributions are two-fold. First, we construct a multi-task dataset by surveying and converting 19 well-known argument mining datasets from the literature into a unified format. Second, we explore various training strategies using Meta AI's Llama-3.1-8B-Instruct model: (1) fine-tuning on individual tasks, (2) fine-tuning jointly on multiple tasks, and (3) merging models fine-tuned separately on individual tasks. Our experiments show that task-specific fine-tuning significantly improves individual performance across all tasks. Moreover, multi-task fine-tuning maintains strong performance without degradation, suggesting effective transfer learning across related tasks. Finally, we demonstrate that model merging offers a viable compromise: it yields competitive performance while mitigating the computational costs associated with full multi-task fine-tuning.

Foundations

The foundational work for this paper's niche, ranked by how specifically the neighbourhood builds on it — not by global fame.

Your Notes