CL AIDec 18, 2024

Meta-Reflection: A Feedback-Free Reflection Learning Framework

Yaoke Wang, Yun Zhu, Xintong Bao, Wenqiao Zhang, Suyang Dai, Kehan Chen, Wenqiang Li, Gang Huang, Siliang Tang, Yueting Zhuang

arXiv:2412.13781v14.83 citationsh-index: 62

Originality Incremental advance

AI Analysis

This addresses the practical limitations of reflection methods for LLMs in real-world applications like e-commerce, though it is incremental as it builds on existing reflection concepts.

The paper tackles the problem of undesirable behaviors in large language models, such as hallucinations, by proposing Meta-Reflection, a feedback-free reflection mechanism that requires only a single inference pass, achieving effectiveness and efficiency on public datasets and a new e-commerce benchmark.

Despite the remarkable capabilities of large language models (LLMs) in natural language understanding and reasoning, they often display undesirable behaviors, such as generating hallucinations and unfaithful reasoning. A prevalent strategy to mitigate these issues is the use of reflection, which refines responses through an iterative process. However, while promising, reflection heavily relies on high-quality external feedback and requires iterative multi-agent inference processes, thus hindering its practical application. In this paper, we propose Meta-Reflection, a novel feedback-free reflection mechanism that necessitates only a single inference pass without external feedback. Motivated by the human ability to remember and retrieve reflections from past experiences when encountering similar problems, Meta-Reflection integrates reflective insights into a codebook, allowing the historical insights to be stored, retrieved, and used to guide LLMs in problem-solving. To thoroughly investigate and evaluate the practicality of Meta-Reflection in real-world scenarios, we introduce an industrial e-commerce benchmark named E-commerce Customer Intent Detection (ECID). Extensive experiments conducted on both public datasets and the ECID benchmark highlight the effectiveness and efficiency of our proposed approach.

View on arXiv PDF

Similar