SEFLMay 2

ClarifySTL: An Interactive LLM Agent Framework for STL Transformation through Requirements Clarification

arXiv:2605.0120966.7h-index: 2
AI Analysis

For developers of cyber-physical systems, ClarifySTL addresses the challenge of ambiguous natural language requirements in STL specification, though the reported improvements are qualitative rather than quantitative.

ClarifySTL is an interactive LLM-agent framework that transforms natural language requirements into Signal Temporal Logic (STL) specifications by first detecting and clarifying vague or ambiguous information. Evaluated on DeepSTL, STL-DivEn, and a new AmbiEval benchmark, it effectively improves STL transformation accuracy.

Signal Temporal Logic (STL) is a formal language for specifying real-time behaviors of cyber-physical systems (CPS). Automatically transforming natural language requirements into STL specifications has received growing attention. Recent efforts leveraging large language models (LLMs) have demonstrated impressive performance, but some natural language requirements in practice contain vague or ambiguous information, which remains challenging for LLMs to handle. To address these challenges, we propose ClarifySTL, an interactive LLM-agent framework that enhances STL transformation through requirements clarification. ClarifySTL first detects vague expressions that indicate underspecified information in a requirement. If any vagueness is detected, it generates targeted clarification queries to guide users in supplementing the requirement until all necessary details are provided. Subsequently, if ClarifySTL detects ambiguities, it formulates focused ambiguity clarification queries and updates the requirements based on user feedback until all ambiguities are resolved. Finally, the requirements with vagueness and ambiguity clarified are transformed into STL specifications using LLMs. This interactive framework ensures that the resulting STL formulas faithfully capture user intent while reducing the burden on the user. We evaluate ClarifySTL on the representative benchmarks DeepSTL and STL-DivEn, as well as our newly introduced AmbiEval benchmark, which is specifically designed to assess the performance of the agents in handling vagueness and ambiguity, including both detection and query generation. The experimental results show that ClarifySTL is effective.

Foundations

The foundational work for this paper's niche, ranked by how specifically the neighbourhood builds on it — not by global fame.

Your Notes