DCApr 22

A Cloud-Native Architecture for Human-in-Control LLM-Assisted OpenSearch in Investigative Settings

arXiv:2604.211254.8h-index: 5
Predicted impact top 73% in DC · last 90 daysOriginality Synthesis-oriented
AI Analysis

For investigators handling large volumes of unstructured evidence, this work provides an architectural baseline for secure, human-in-control LLM-assisted search, though it is an incremental design study without empirical results.

The paper presents a cloud-native architecture for integrating LLMs into investigative search workflows, translating natural language queries into OpenSearch DSL. A functional prototype demonstrates feasibility, with evaluation planned using the Enron Email Dataset.

Complex criminal investigations are often hindered by large volumes of unstructured evidence and by the semantic gap between natural language investigative intent and technical search logic. To address this challenge, we present a design and feasibility study of a cloud-native microservice architecture tailored to private-cloud deployments, contributing to research in secure cloud computing and leveraging modern cloud paradigms under high security and scalability requirements. The proposed system integrates Large Language Models into a "Human-in-Control" workflow that translates natural-language queries into syntactically valid OpenSearch Domain-Specific Language expressions. We describe the implementation of a hybrid retrieval strategy within OpenSearch that combines BM25-based lexical search with nested semantic vector embeddings. The paper focuses on system design and preliminary functional validation, establishing an architectural baseline for future empirical evaluation. Technical feasibility is demonstrated through a functional prototype, and a rigorous evaluation methodology is outlined using the Enron Email Dataset as a structural proxy for restricted investigative corpora.

Foundations

The foundational work for this paper's niche, ranked by how specifically the neighbourhood builds on it — not by global fame.

Your Notes