From Untestable to Testable: Metamorphic Testing in the Age of LLMs
This tackles the problem of unreliable AI testing for developers, but it appears incremental as it adapts an existing testing method to LLMs.
The paper addresses the challenge of testing software systems with integrated AI and LLM functionalities, where labeled ground truth is scarce, by proposing Metamorphic Testing as a solution that uses relations among test executions to create executable test oracles.
This article discusses the challenges of testing software systems with increasingly integrated AI and LLM functionalities. LLMs are powerful but unreliable, and labeled ground truth for testing rarely scales. Metamorphic Testing solves this by turning relations among multiple test executions into executable test oracles.