CLAug 14, 2025

Towards Reliable Multi-Agent Systems for Marketing Applications via Reflection, Memory, and Planning

arXiv:2508.11120v21 citations
Originality Incremental advance
AI Analysis

This work addresses reliability issues in multi-agent systems for marketing applications, offering incremental improvements for industry deployment.

The paper tackled the problem of unreliable AI agents in real-world marketing tasks by introducing the RAMP framework, which improved accuracy by 28 percentage points on evaluation queries and increased recall by roughly 20 percentage points with iterative verification.

Recent advances in large language models (LLMs) enabled the development of AI agents that can plan and interact with tools to complete complex tasks. However, literature on their reliability in real-world applications remains limited. In this paper, we introduce a multi-agent framework for a marketing task: audience curation. To solve this, we introduce a framework called RAMP that iteratively plans, calls tools, verifies the output, and generates suggestions to improve the quality of the audience generated. Additionally, we equip the model with a long-term memory store, which is a knowledge base of client-specific facts and past queries. Overall, we demonstrate the use of LLM planning and memory, which increases accuracy by 28 percentage points on a set of 88 evaluation queries. Moreover, we show the impact of iterative verification and reflection on more ambiguous queries, showing progressively better recall (roughly +20 percentage points) with more verify/reflect iterations on a smaller challenge set, and higher user satisfaction. Our results provide practical insights for deploying reliable LLM-based systems in dynamic, industry-facing environments.

Foundations

The foundational work for this paper's niche, ranked by how specifically the neighbourhood builds on it — not by global fame.

Your Notes