CLAIDec 22, 2025

Auto-Prompting with Retrieval Guidance for Frame Detection in Logistics

arXiv:2512.19247v1h-index: 3
Originality Incremental advance
AI Analysis

This work addresses the challenge of adapting large language models to domain-specific NLP tasks like logistics text annotation without extensive fine-tuning, offering a scalable alternative for practical deployment.

The paper tackles the problem of frame detection in logistics texts by proposing a novel prompt optimization pipeline that combines retrieval-augmented generation, few-shot prompting, chain-of-thought reasoning, and automatic CoT synthesis, resulting in up to 15% improvement in real-world inference accuracy compared to baseline prompts.

Prompt engineering plays a critical role in adapting large language models (LLMs) to complex reasoning and labeling tasks without the need for extensive fine-tuning. In this paper, we propose a novel prompt optimization pipeline for frame detection in logistics texts, combining retrieval-augmented generation (RAG), few-shot prompting, chain-of-thought (CoT) reasoning, and automatic CoT synthesis (Auto-CoT) to generate highly effective task-specific prompts. Central to our approach is an LLM-based prompt optimizer agent that iteratively refines the prompts using retrieved examples, performance feedback, and internal self-evaluation. Our framework is evaluated on a real-world logistics text annotation task, where reasoning accuracy and labeling efficiency are critical. Experimental results show that the optimized prompts - particularly those enhanced via Auto-CoT and RAG - improve real-world inference accuracy by up to 15% compared to baseline zero-shot or static prompts. The system demonstrates consistent improvements across multiple LLMs, including GPT-4o, Qwen 2.5 (72B), and LLaMA 3.1 (70B), validating its generalizability and practical value. These findings suggest that structured prompt optimization is a viable alternative to full fine-tuning, offering scalable solutions for deploying LLMs in domain-specific NLP applications such as logistics.

Foundations

The foundational work for this paper's niche, ranked by how specifically the neighbourhood builds on it — not by global fame.

Your Notes