LGCLApr 25

Evolve: A Persistent Knowledge Lifecycle for Small Language Models

arXiv:2604.234247.5
Predicted impact top 81% in LG · last 90 daysOriginality Incremental advance
AI Analysis

For practitioners needing accurate small models without constant teacher calls, Evolve offers a practical knowledge lifecycle that amortizes teacher costs and improves accuracy significantly.

Evolve pairs a small 2B-parameter local language model with a persistent, teacher-compiled knowledge store, achieving 60-84% accuracy (+40-52pp over baseline) across 750 queries while reducing teacher invocations by over 50% through knowledge reuse.

Evolve pairs a small local language model with a persistent, teacher-compiled knowledge store -- refined through sleep consolidation and usage-driven refresh -- to deliver substantial accuracy gains over the model's parametric baseline while amortizing teacher costs through cross-query knowledge reuse. Rather than retrieving document fragments at query time, Evolve constructs a store of semantically coherent sections compiled by teacher models at natural conceptual boundaries; new sections are staged on acquisition, consolidated offline through teacher-mediated merging, and refreshed inline when expired. A 2B-parameter local model handles classification and generation; large teacher models are invoked only for knowledge operations. Across 750 benchmark queries spanning custom specialist questions, NaturalQuestions, and TriviaQA, the 2B model augmented by Evolve improves from 20-33% baseline accuracy to 60-84% (+40-52pp) while reducing teacher invocations by over 50% through reuse. Post-consolidation compresses the knowledge store by 31-33.5% across three independent benchmarks while preserving accuracy; section-based retrieval outperforms chunk-based retrieval by 5-9pp across every lifecycle condition. The architecture supports two generation modes over the same lifecycle -- suppress (strict section-only grounding, auditable) and augment (section-supplemented responses).

Foundations

The foundational work for this paper's niche, ranked by how specifically the neighbourhood builds on it — not by global fame.

Your Notes