LGAIMAOct 3, 2025

The Argument is the Explanation: Structured Argumentation for Trust in Agents

arXiv:2510.03442v1h-index: 11
Originality Incremental advance
AI Analysis

This addresses the need for trustworthy explanations in AI systems, particularly for stakeholders requiring verifiable reasoning, though it appears incremental in applying existing argumentation methods to new domains.

The paper tackles the problem of AI explainability by proposing structured argumentation to provide verifiable reasoning chains instead of mechanistic transparency, achieving state-of-the-art results of 94.44 macro F1 on AAEC (5.7 points above prior work) and 0.81 macro F1 on Argumentative MicroTexts relation classification.

Humans are black boxes -- we cannot observe their neural processes, yet society functions by evaluating verifiable arguments. AI explainability should follow this principle: stakeholders need verifiable reasoning chains, not mechanistic transparency. We propose using structured argumentation to provide a level of explanation and verification neither interpretability nor LLM-generated explanation is able to offer. Our pipeline achieves state-of-the-art 94.44 macro F1 on the AAEC published train/test split (5.7 points above prior work) and $0.81$ macro F1, $\sim$0.07 above previous published results with comparable data setups, for Argumentative MicroTexts relation classification, converting LLM text into argument graphs and enabling verification at each inferential step. We demonstrate this idea on multi-agent risk assessment using the Structured What-If Technique, where specialized agents collaborate transparently to carry out risk assessment otherwise achieved by humans alone. Using Bipolar Assumption-Based Argumentation, we capture support/attack relationships, thereby enabling automatic hallucination detection via fact nodes attacking arguments. We also provide a verification mechanism that enables iterative refinement through test-time feedback without retraining. For easy deployment, we provide a Docker container for the fine-tuned AMT model, and the rest of the code with the Bipolar ABA Python package on GitHub.

Foundations

The foundational work for this paper's niche, ranked by how specifically the neighbourhood builds on it — not by global fame.

Your Notes