SEAISep 28, 2025

Navigating the Labyrinth: Path-Sensitive Unit Test Generation with Large Language Models

arXiv:2509.23812v22 citationsh-index: 9ASE
Originality Incremental advance
AI Analysis

This addresses the challenge of time-consuming and error-prone unit testing for software developers, representing an incremental improvement over prior methods.

The paper tackles the problem of automating unit test generation for software quality assurance by introducing a path-sensitive framework, JUnitGenie, which improves branch and line coverage by 29.60% and 31.00% on average over existing baselines.

Unit testing is essential for software quality assurance, yet writing and maintaining tests remains time-consuming and error-prone. To address this challenge, researchers have proposed various techniques for automating unit test generation, including traditional heuristic-based methods and more recent approaches that leverage large language models (LLMs). However, these existing approaches are inherently path-insensitive because they rely on fixed heuristics or limited contextual information and fail to reason about deep control-flow structures. As a result, they often struggle to achieve adequate coverage, particularly for deep or complex execution paths. In this work, we present a path-sensitive framework, JUnitGenie, to fill this gap by combining code knowledge with the semantic capabilities of LLMs in guiding context-aware unit test generation. After extracting code knowledge from Java projects, JUnitGenie distills this knowledge into structured prompts to guide the generation of high-coverage unit tests. We evaluate JUnitGenie on 2,258 complex focal methods from ten real-world Java projects. The results show that JUnitGenie generates valid tests and improves branch and line coverage by 29.60% and 31.00% on average over both heuristic and LLM-based baselines. We further demonstrate that the generated test cases can uncover real-world bugs, which were later confirmed and fixed by developers.

Foundations

The foundational work for this paper's niche, ranked by how specifically the neighbourhood builds on it — not by global fame.

Your Notes