AIARJun 13, 2025

PRO-V: An Efficient Program Generation Multi-Agent System for Automatic RTL Verification

arXiv:2506.12200v13 citationsh-index: 6Has Code
Originality Incremental advance
AI Analysis

This addresses the challenge of reducing cost and effort in hardware verification for engineers, though it appears incremental by building on existing LLM capabilities.

The paper tackles the problem of generating correct testbenches for Register Transfer Level (RTL) verification using LLMs, proposing PRO-V, a multi-agent system that achieves 87.17% verification accuracy on golden RTL implementations and 76.28% on RTL mutants.

LLM-assisted hardware verification is gaining substantial attention due to its potential to significantly reduce the cost and effort of crafting effective testbenches. It also serves as a critical enabler for LLM-aided end-to-end hardware language design. However, existing current LLMs often struggle with Register Transfer Level (RTL) code generation, resulting in testbenches that exhibit functional errors in Hardware Description Languages (HDL) logic. Motivated by the strong performance of LLMs in Python code generation under inference-time sampling strategies, and their promising capabilities as judge agents, we propose PRO-V a fully program generation multi-agent system for robust RTL verification. Pro-V incorporates an efficient best-of-n iterative sampling strategy to enhance the correctness of generated testbenches. Moreover, it introduces an LLM-as-a-judge aid validation framework featuring an automated prompt generation pipeline. By converting rule-based static analysis from the compiler into natural language through in-context learning, this pipeline enables LLMs to assist the compiler in determining whether verification failures stem from errors in the RTL design or the testbench. PRO-V attains a verification accuracy of 87.17% on golden RTL implementations and 76.28% on RTL mutants. Our code is open-sourced at https://github.com/stable-lab/Pro-V.

Code Implementations1 repo
Foundations

The foundational work for this paper's niche, ranked by how specifically the neighbourhood builds on it — not by global fame.

Your Notes