CLAIOct 1, 2025

Rationale-Augmented Retrieval with Constrained LLM Re-Ranking for Task Discovery

arXiv:2510.05131v1
Originality Incremental advance
AI Analysis

This addresses task discovery challenges for staff in Head Start programs using GoEngage, but it is incremental as it builds on existing retrieval methods.

The paper tackles the problem of staff struggling to find tasks on the GoEngage platform due to jargon and search limitations by proposing a hybrid semantic search system, achieving improvements in retrieval metrics like Hit@K and Precision@K.

Head Start programs utilizing GoEngage face significant challenges when new or rotating staff attempt to locate appropriate Tasks (modules) on the platform homepage. These difficulties arise from domain-specific jargon (e.g., IFPA, DRDP), system-specific nomenclature (e.g., Application Pool), and the inherent limitations of lexical search in handling typos and varied word ordering. We propose a pragmatic hybrid semantic search system that synergistically combines lightweight typo-tolerant lexical retrieval, embedding-based vector similarity, and constrained large language model (LLM) re-ranking. Our approach leverages the organization's existing Task Repository and Knowledge Base infrastructure while ensuring trustworthiness through low false-positive rates, evolvability to accommodate terminological changes, and economic efficiency via intelligent caching, shortlist generation, and graceful degradation mechanisms. We provide a comprehensive framework detailing required resources, a phased implementation strategy with concrete milestones, an offline evaluation protocol utilizing curated test cases (Hit@K, Precision@K, Recall@K, MRR), and an online measurement methodology incorporating query success metrics, zero-result rates, and dwell-time proxies.

Foundations

The foundational work for this paper's niche, ranked by how specifically the neighbourhood builds on it — not by global fame.

Your Notes