AIFeb 18, 2025

Interactive Agents to Overcome Ambiguity in Software Engineering

AI2CMU
arXiv:2502.13069v118 citationsh-index: 49
Originality Synthesis-oriented
AI Analysis

This addresses safety and efficiency issues for AI systems in software engineering, but it is incremental as it structures evaluation rather than proposing a new solution.

The study tackled the problem of AI agents making unwarranted assumptions from ambiguous user instructions in software engineering, finding that while models struggle to detect ambiguity, interactive questioning significantly improves performance.

AI agents are increasingly being deployed to automate tasks, often based on ambiguous and underspecified user instructions. Making unwarranted assumptions and failing to ask clarifying questions can lead to suboptimal outcomes, safety risks due to tool misuse, and wasted computational resources. In this work, we study the ability of LLM agents to handle ambiguous instructions in interactive code generation settings by evaluating proprietary and open-weight models on their performance across three key steps: (a) leveraging interactivity to improve performance in ambiguous scenarios, (b) detecting ambiguity, and (c) asking targeted questions. Our findings reveal that models struggle to distinguish between well-specified and underspecified instructions. However, when models interact for underspecified inputs, they effectively obtain vital information from the user, leading to significant improvements in performance and underscoring the value of effective interaction. Our study highlights critical gaps in how current state-of-the-art models handle ambiguity in complex software engineering tasks and structures the evaluation into distinct steps to enable targeted improvements.

Code Implementations1 repo
Foundations

The foundational work for this paper's niche, ranked by how specifically the neighbourhood builds on it — not by global fame.

Your Notes